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© The present invention relates to monocot genes encoding a mutant AHAS enzyme that is specifically 
resistant 1o imidazolinone herbicides. Exemplary of these genes are corn DNA sequences which encode an 
amino acid substitution at position 621 of the wild-type AHAS enzyme. The mutant gene can be used to 
transform other plants to herbicide resistance; in this regard, the invention also provides host cells and vectors 
containing the gene, which cells and vectors are useful in the transformation process. 
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This invention relates to novel DNA sequences that encode novel variant forms of acetohydroxy acid 
synthase enzyme (hereinafter AHAS). The AHAS enzyme is a critical enzyme routinely produced in a 
variety of plants and a broad range of microorganisms. Normal AHAS function is inhibited by imidazolinone 
herbicides; however, new AHAS enzymes encoded by the mutant DNA sequences function normally 
5 catalytically even in the presence of imidazolinone herbicides and, therefore, confer herbicide resistance 
upon the plant or microorganism containing them. 

The novel DNA sequences are derived from corn and have a substitution of an amino acid at position 
621 of the normal AHAS sequence. This substitution in the AHAS gene sequence results in a fully functional 
enzyme, but renders the enzyme specifically resistant to inhibition by a variety of imidazolinone herbicides. 
io The availability of these variant sequences provides a tool for transformation of different crop plants to 
imidazolinone herbicide resistance, as well as providing novel selectable markers for use in other types of 
genetic transformation experiments. 

BACKGROUND OF THE INVENTION 

»5 

The use of herbicides in agriculture is now widespread. Although there are a large number of available 
compounds which effectively destroy weeds, not all herbicides are capable of selectively targeting the 
undersirable- plants over_crop plants?sas-weH=as being non-toxic a fS='animals. Often, it is necessary to settle ™'' 
for compounds which are simply less toxic to crop plants than to weeds. In order to overcome this problem, 

20 development of herbicide resistant crop plants has become a major focus of agricultural research. 

An important aspect of development of herbicide-resistance is an understanding of the herbicide target, 
and then manipulating the affected biochemical pathway in the crop plant so that the inhibitory effect is 
avoided while the plant retains normal biological function. One of the first discoveries of the biochemical 
mechanism of herbicides related to a series of structurally unrelated herbicide compounds, the im- 

25 idazolinones, the sulfonylureas and the triazolopyrimidines. It is now known (Shaner et al. Plant Physiol. 76: 
545-546,1984: U.S. Patent No. 4,761,373) that each of these herbicides inhibits plant growth by interference 
with an essential enzyme required for plant growth, acetohydroxyacid synthase (AHAS; also referred to as 
acetolacetate synthase, or ALS). AHAS is required for the synthesis of the amino acids isoleucine, leucine 
and valine. 

30 The AHAS enzyme is known to be present throughout higher plants, as well as being found in a variety 
of microorganisms, such as the yeast Saccharomyces cerevisiae , and the enteric bacteria, Escherichia coli 
and Salmonella typhimurium , The genetic basis for the production of normal AHAS in a number of these 
species has also been well characterized. For example, in both E. coli and S. typhimurium three isozymes 
of AHAS exist; two of these are sensitive to herbicides while a third is not. Each of these isozymes 

35 possesses one large and one small protein subunit; and map to the llvlH, HvGM and llvBN operons. In 
yeast, the single AHAS isozyme has been mapped to the ILV2 locus. In each case, sensitive and resistant 
forms have been identified and sequences of the various alleles have been determined (Friden et aL, Nucl . 
Acid Res. 13: 3979-3993, 1985; Lawther et aL, PNAS USA 78: 922-928, 1982; Squires et al, Nucl. Acids 
Res 811: 5299-5313, 1983; Wek et al; Nucl. Acidjjtes 13: 4011-4027, 1985; Falco and Dumas, Genetics 

40 i 09, 21-35, 985; Falco et al. Nucl. Acids~Res~ 13; 401 1 -4027, 1 985). 

In tobacco, AHAS function is encoded by two unlinked genes, SuRA and SuRB. There is substantial 
identity between the two genes, both at the nucleotide level and amino acid level in the mature protein, 
although the N-terminal, putative transit region differs more substantially (Lee et al, EMBO J. 7: 1241-1248, 
19B8). Arabidopsis. on the other hand, has a single AHAS gene, which has also been completely sequenced 

45 (Mazur et al., Plant Physiol. 85:1110-1117, 1987). Comparisons among sequences of the AHAS genes in 
higher plants indicates a highlevel of conservation of certain regions of the sequence; specifically, there are 
at least 10 regions of sequence conservation, it has previously been assumed that these conserved regions 
are critical to the function of the enzyme, and that retention of that function is dependent upon substantial 
sequence conservation. 

so It has been recently reported (U.S. Patent No. 5.013,659) that mutants exhibiting herbicide resistance 
possess mutations in at least one amino acid in one or more of these conserved regions. In particular, 
substitution of certain amino acids for the wild type amino acid at these specific sites in the AHAS protein 
sequence have been shown to be tolerated, and indeed result in herbicide resistance of the plant 
possessing this mutation, while retaining catalytic function. The mutations described therBin encode either 

ss cross resistance for imidazolinones and sulfonylureas or sulfonylurea-specific resistance, but no 
imidazolinone-specific mutations were disclosed. These mutations have been shown to occur at both the 
SuRA and SuRB loci in tobacco; similar mutations have been isolated in Arabidopsis and yeast. 

Imidazolinone-specific resistance has been reported elsewhere in a number of plants. U.S. Patent No. 
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4.761.373 generally described an altered AHAS as a basis of herbicide resistance in plants, and specifically 
disclosed certain imidazolinone resistant corn lines. Haughn et aL ( Mol. Gen. Genet. 211:266-271, 1988) 
disclosed the occurrence of a similar phenotype in Arabidopsis . Sathasivan et al. (Nucl. Acid Res. 18:2188, 
1990) identified the imidazolinone-specific resistance in Arabidopsis as being based on a mutation at 
5 position 653 in the normal AHAS sequence. In accordance with the present invention, a gene encoding 
imidazolinone-specific resistance in a monocot has now been isolated and determined to be associated with 
a single amino acid substitution in a wild-type monocot AHAS amino acid sequence. 

SUMMARY OF THE INVENTION 

The present invention provides novel nucleic acid sequences encoding functional monocot AHAS 
enzymes insensitive to imidazolinone herbicides. The sequences in question comprise a mutation in the 
codon encoding the amino acid serine at position 621 in the corn (maize) AHAS sequence, or In the 
corresponding position in other monocot sequences. Other monocots, such as wheat, are also known to 

»5 exhibit imidazolinone specific mutations (e.g., ATCC Nos. 40994-97). In corn, the wild type sequence has a 
serine at this position. In a preferred embodiment, the substitution is asparagine for serine, but alternate 
substitutions for serine include aspartic acid, glutamic acid, glutamine and tryptophane. Although the 
. ^tg^y glaimedgsequences are originally „derjjiedj£om. monocots, the novel sequences -are-usefuUnxroethods^for,^. 
producing imidazolinone resistant cells in any type of plant, said methods comprising transforming a target 

20 plant cell with one or more of the altered sequences provided herein. Alternatively, mutagenesis is utilized 
to create mutants in plant cells or seeds containing a nucleic acid sequence encoding an imidazolinone 
insensitive AHAS. In the case of mutant plant cells isolated in tissue culture, plants which possess the 
imidazolinone resistant or insensitive trait are then regenerated. The invention thus also encompasses plant 
cells, bacterial cells, fungal cells, plant tissue cultures, adult plants, and plant seeds that possess a mutant 

25 nucleic acid sequence and which express functional imidazolinone resistant AHAS enzymes. 

The availability of these novel herbicide resistant plants enables new methods of growing crop plants in 
the presence of imidazolinones. Instead of growing non-resistant plants, fields may be planted with the 
resistant plants produced by mutation or by transformation with the mutant sequences of the present 
invention, and the field routinely treated with imidazolinones, with no resulting damage to crop plants. 

30 The mutant nucleic acids of the present invention also provide novel selectable markers for use in 
transformation experiments. The nucleic acid sequence encoding a resistant AHAS is linked to a second 
gene prior to transfer to a host cell, and the entire construct transformed into the host. Putative transformed 
cells are then grown in culture in the presence of inhibitory amounts of herbicide; surviving cells will have a 
high probability of having successfully acquired the second gene of interest. Alternately, the resistant AHAS 

35 gene can be cotransformed on an independent plasmid with the gene of interest, whereby about 50% of all 
transformants can be expected to have received both genes. 

The following definitions should be understood to apply throughout the specification . and claims. A 
"functional" or "normal" AHAS enzyme is one which is capable of catalyzing the first step in the pathway 
for synthesis of the essential amino acids isoleucine. leucine and valine. A "wild-type" AHAS sequence is a 

40 sequence present in an imidazolinone sensitive member of a given species. A "resistant" plant is one which 
produces a mutant but functional AHAS enzyme, and which is capable of reaching maturity when grown in 
the presence of normally inhibitory levels of imidazolinone. The term "resistant", as used herein, is also 
intended to encompass "tolerant" plants, i.e., those plants which phenotypically evidence adverse, but not 
lethal, reactions to the imidazolinone. 

45 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 : AHAS enzyme activity in 10-day old maize seedlings (corn lines A619 or X1 12) in the 
presence of imazethapyr (Pursuit™ A) or chlorsulturon (Oust™ B). Herbicide resistant AHAS activity is 
so calculated as percentage of AHAS activity in the absence of inhibitor. The slandard error between 
experimets is 10%. 

Figure 2: Southern analysis of genomic clones in phage EMBL3. Phages 12-1 A (from W22), 12-7A, 18- 
8A, 12-1 1, and 12-17A (From XI12) are digested with Xbal or Sail, separated on a 1% agarose gel. 
transfered onto nitrocellulose and hybridized with an AHAS cDNA fragment as probe. 
55 Figure 3: Southern analysis of genomic DNA from corn lines XI12, XA17, QJ22. A188 and B73. The 

DNA is digested with Xbal, separated on a 1% agarose gel, transferred onto nitrocellulose and hybridized 
with an AHAS cDNA fragment as probe. 

Figure 4: Restriction map of plasmid pCD8A. The mutant AHAS gene from XI12 was subcloned as a 
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8kb Pstl fragment into vector pKS( + ). The location and orientation of the AHAS gene is indicated by an 
arrow. The restriction sues ol Pstl. Xhol, Hindlll, Xbal and Clal are represented by symbols. 

Figure 5: Nucleotide sequencing gel of the non-coding strand (A) and the double stranded DNA 
sequence <B) ol AHAS clones W22/4-4, B73/10-4 and XI12/8A in the region of amino acids 614 to 633. The 
5 position of the cylosinu to thymidine transition is indicated by an arrow. 

Figure.6: Nucleotide and deduced.amino acid sequences of the XI12/8A mutant AHAS gene. 

Figure 7: Nucleotide sequence alignment of XI12'8A, B73/7-4 and W22/1A als2 genes. (*) marks the 
base change causing the mutation at position 621, (#) differences from the B73/7-4 sequence and (>) 
represents silent changes 

io Figure 8: Amino acid sequences and alignment of XI12/BA, B73/7-4 and W22/1A als 2 genes. (*) marks 
the mutation at position 621. (#) marks differences from the B73f7-4 sequence, and (>) represents silent 
changes. 

DETAILED DESCRIPTION OF THE INVENTION 

IS 

The gene of the present invention is isolatable from corn maize line XI12 (seed deposited with the 
American Type Culture Collection as Accession Number 75051), and has been inserted into plasmid 

.,„, pXI12/8A (depositejfcjfiiJih. the American Type Culture -.©olleetiorafcas- Accession^Numbfer=68643)T£lt-ts.-also 
isolatable from any other imidazolinone-specific herbicide resistant mutant, such as the com line QJ22 

20 (deposited as a cell culture with the American Type Culture Collection as Accession Number 40129), or the 
various wheat plants (seed deposited with the American Type Collection as Accession Numbers 40994, 
40995, 40996. or 40997) A gonomic DNA library is created, for example, in phage ENBL-3 with DNA from 
one of the imidazolinone resistant mutants, preferably one which is homozygous for the resistance trait, and 
is screened with a nucleic acid probe comprising all or a part of a wild-type AHAS sequence. 

25 In maize, the AHAS gene is found at two ioci, alsl and als2 (Burr and Burr. Trends in Genetics 7:55-61, 
1991); the homology between the two loci is 95% at the nucleotide level. The mutation in XI12 is mapped to 
locus als2 on chromosome 5. whereas the nonmutant gene is mapped to locus alsl on chromosome 4 
(Newhouse et al., "Imidazolinone-resistant crops". In The Imidazolinone Herbicides , Shaner and O'Connor. 
(Eds.). CRC~Press, Boca Raton, Ft, in Press) Southern analysis identifies some clones containing the 

30 mutant als2 gene, and some containing the non-mutant alsl gene. Both types are subcloned into 
sequencing vectors, and sequenced by the dideoxy sequencing method. 

Sequencing and comparison of wild type and mutant AHAS genes shows a difference of a single 
nucleotide in the codon encoding the amino acid at position 621 (Figure 5). Specifically, the codon AGT 
encoding serine in the wild type is changed to AAT encoding asparagine in the mutant AHAS (Figure 8). 

35 The mutant AHAS gene is otherwise similar to the wild type gene, encoding a protein having 638 amino 
acids, the first 40 of which constitute a transit peptide which is thought to be cleaved during transport into 
the chloroplast in vivo. The sequence of the alsl non-mutant gene from XI12 appears to be identical to the 
alsl gene from B73. 

The mutant genes of the present invention confer resistance to imidazolinone herbicides, but not to 

40 sulfonylurea herbicides. Types of herbicides to which resistance is conferred are described for example in 
U.S. Patent Nos. 4,188,487; 4,201,565; 4,221,586; 4,297,128; 4,554,013; 4,608,079; 4,638,068; 4,747,301; 
4.650,514; 4,698.092; 4.701,208; 4,709,036; 4,752,323; 4,772,311; and 4,798,619. 

It will be understood by those skilled in the art that the nucleic acid sequence depicted in Figure 6 is 
not the only sequence which can be used to confer imidazolinone-specific resistance. Also contemplated 

45 are those nucleic acid sequences which encode an identical protein but which, because of the degeneracy 
of the genetic code, possess a different nucleotide sequence. The invention also encompasses genes 
encoding AHAS sequences in which the atorestated mutation is present, but which also encode one or 
more silent amino acid changes in portions of the molecule not involved with resistance or catalytic 
function. Also contemplated are gene sequences from other imidazolinone resistant monocots which have a 

so mutation in the corresponding region of the sequences. 

For example, alterations in the gene sequence which result in the production of a chemically equivalent 
amino acid at a given site are contemplated; thus, a codon for the amino acid alanine, a hydrophobic amino 
acid, can readily be substituted by a codon encoding another hydrophobic residue, such as glycine, or may 
be substituted with a more hydrophobic residue such as valine, leucine, or isoleucine. Similarly, changes 

55 which result in substitution of one negatively charged residue for another, such as aspartic acid for glutamic 
acid, or one positively charged residue for another, such as lysine for arginine, can also be expected to 
produce a biologically equivalent product. The invention also encompasses chimaeric genes, in which the 
substituted portion of the corn AHAS gene is recombined with unaltered portions of the normal AHAS genes 
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from other species. Thus, throughout the specification and claims, wherever the term "imidazolinone- 
specific resistant corn AHAS gene" is used, it is intended to cover each of these alternate embodiments as 
well as the sequence of Figure 6. 

Isolated AHAS DNA sequences of the present invention are useful to transform target crop plants, and 

5 (hereby confer imidazolinone resistance. A broad range ol techniques currently exist for achieving direct or 
indirect transformation of higher plants with exogenous DNA, and any method by which the novel sequence 
can be incorporated into the host genome, and stably inherited by its progeny, is contemplated by the 
present invention. The imidazolinone specific resistance trait is inherited as a single dominant nuclear gene. 
The level of imidazolinone resistance is increased when the gene is present in a homozygous state; such 

ro . corn plants, for example, have a resistance level of about 1 .000 times that of a non-resistant plant. Plants 
heterozygous for the trait, however, have a resistance of about 50-500 times that of a non-rBsistant plant. 

Transformation of plant cells can bo mediated by the use of vectors. A common method of achieving 
transformation is tho use of Agrobacterium tumefaciens to introduce a foreign gene into the target plant cell. 
For example, the mutant AHAS sequence is inserted into a plasmid vector containing the flanking 

75 sequences in the Ti-ptasmid T-DNA. The plasmid is then transformed into E. coli. A triparental mating 
among this strain, an Agrobacterium strain containing a disarmed Ti-plasmid containing the virulence 
functions needed to effect transfer of the AHAS containing T-DNA sequences into the target plant 

„ chromosome, and .a second^ c^^strain^ necessary tosaaobilize 

transfer of the AHAS construct from E. coli to Agrobacterium is carried out. A recombinant Agrobacterium 

so strain, containing the necessary sequences for plant transformation is used to infect leaf discs. Discs are 
grown on selection media and successfully transformed regenerants are identified. The recovered plants are 
resistant to the effects of herbicide when grown in its presence. Plant viruses also provide a possible means 
for transfer of exogenous DNA. 

Direct uptake of plant cells can also be employed. Typically, protoplasts of the target plant are placed 

25 in culture in the presence of the DNA to be transferred, and an agent which promotes the uptake of DNA by 
protoplast. Useful agents in this regard are polyethylene glycol or calcium phosphate. 

Alternatively, DNA uptake can be stimulated by electroporation. In this method, an electrical pulse is 
used to open temporary pores in a protoplast cell membrane, and DNA in the surrounding solution is then 
drawn into the cell through the pores. Similarly, microinjection can be employed to deliver the DNA directly 

30 into a cell, and preferably directly into the nucleus of the cell. 

In each of the foregoing techniques, transformation occurs in a plant cell in culture. Subsequent to the 
transformation event, plant cells must be regenerated to whole plants. Techniques for the regeneration of 
mature plants from callus or protoplast culture are now well known for a large number of different species 
(see, e.g., Handbook of Plant Cell Culture , Vols. 1-5, 1983-1989 McMillan, N.Y.) Thus, once transformation 

as has been achieved, it is within the knowledge in the art to regenerate mature plants from the transformed 
plant cells. 

Alternate methods are also now available which do not necessarily require the use of isolated cells, and 
therefore, plant regeneration techniques, to achieve transformation. These are generally referred to as 
"ballistic" or "particle acceleration" methods, in which DNA coated metal particles are propelled into plant 

40 cells by either a gunpowder charge (Klein et al.. Nature 327: 70-73, 1987) or electrical discharge (EPO 270 
356). In this manner, plant cells in culture or plant reproductive organs or cells, e.g. pollen, can be stably 
transformed with the DNA sequence of interest. 

In certain dicots and monocots direct uptake of DNA is the preferred method of transformation. For 
example, in corn, the cell wall of cultured cells is digested in a buffer with one or more cell wall degrading 

«s enzymes, such as cellulase, hemicellulase and pectinase, to isolate viable protoplasts. The protoplasts are 
washed several times to remove the enzymes, and mixed with a plasmid vector containing the gene of 
interest. The cells can be transformed with either PEG (e.g. 20% PEG 4000) or by electroporation. The 
protoplasts are placed on a nitrocellulose filter and cultured on a medium with embedded corn cells 
functioning as feeder cultures. After 2-4 weeks, the cultures in the nitrocellulose filter are placed on a 

so medium containing about 0.32 uM of the imidazolinone and maintained in the medium for 1-2 months. The 
nitrocellulose filters with the plant cells are transferred to fresh medium with herbicides and nurse cells 
every two weeks. The untransformed cells cease growing and die after a few weeks. 

The present invention can be applied to transformation of virtually any type of plant, both monocot and 
dicot. Among the crop plants for which transformation to herbicide resistance is contemplated are corn, 

55 wheat, rice, millet, oat, barley, sorghum, sunflower, sweet potato, alfalfa, sugar beet, Brassica species, 
tomato, pepper, soybean, tobacco, melon, squash, potato, peanut, pea, cotton, or cacao. The novel 
sequences may also be used to transform ornamental species, such as rose, and woody species, such as 
pine and poplar, 

5 
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The novel sequences ol the invention also are useful as selectable markers in plant genetics studies. 
For example, in plant transformation, sequences encoding Imidazolinone resistance could be linked to a 
gene of interest which is to be used to transform a target imidazolinone sensitive plant cell. The construct 
comprising both the gene of interest and the imidazolinone resistant sequence are introduced into the plant 

5 cell, and the plant cells are then grown in the presence of an inhibitory amount of an imidazolinone. 
Alternately, the second gene of interest.can be colransformed, on a separate plasmid, into the host cells. 
Plant cells surviving such treatment presumably have acquired the resistance gene as well as the gene of 
interest, and therefore, only transformants survive the selection process with the herbicide. Confirmation of 
successful transformation and expression of both genes can be achieved by Southern hybridization of 

io genomic DNA, by PCR or by observation of the phenotypic expression of the genes. 
The invention is further illustrated by the following non-limiting examples. 

EXAMPLES 

js 1 . Confir mation of Whole Plant Herbicide Resistance in XI12 

X1 12 plants are treated with herbicides at 10 days to the V3 leaf stage (4-5 leaves, of which 3 have 
. . .... .yisibje.-jigyjes). Imazethapyr.4§a3pj?.lied at rates of 2000, .500,-250 ,^125^nd,62?5*g/ha^ 

.r- - a p p |j ec |*" at 32 16. 8, 4 and 2 g/ha. Plants are treated postemergence at a spray volume of 400 l/ha. After 

so spraying, plants are placed in the greenhouse for further observation. 

XI12 plants are unaffected at all rates of imazethapyr treatment; however, no visible increased 
resistance to chlorsulfuron is noted. Thus, XI12 displays selective resistance to the imidazolinone at the 
whole plant level (See Figure 1 ). 

The resistance in X1 12 is also shown to be inherited as a single dominant allele of a nuclear gene. 
25 Heterozygous resistant XI12 are selfed. and the selfed progeny segregate in the 3 resistant:1 susceptible 
ratio expected for a single dominant allele of a nuclear gene. In this study, the segregating seedlings are 
sprayed postemergence with lethal doses of imazethapyr (125 or 250 g/ha) following spraying protocols 
described above, to establish segregation for resistance. 

30 2. AHAS Extraction 

Seeds of XII 2 are sown in soil in a greenhouse maintained at day/night temperature of 80°C and 15 
hour photoperiod. Plants are harvested two weeks after planting. The basal portion of the shoot is used for 
the extraction of AHAS. 5 g of the tissue is powdered in liquid nitrogen and then homogenized in AHAS 
35 assay buffer comprising 100 mM potassium phosphate buffer (pH 7.5) containing 10 mM pyruvate, 5 mM 
MgCIa, 5 mM EDTA, 100 uM FAD (flavin adenine dinucleotide), 1 mM valine. 1 mM leucine, 10% glycerol 
and 10 mM cysteine. The homogenate is centrifuged at 10,000 rpm for 10 minutes and 3 ml of the 
supernatant are applied onto an equilibrated Bio-Rad Econo-Desalting column (10 DG) and eluted with 4 ml 
AHAS assay buffer. 

40 AHAS activity is measured by estimation of the product, acetolactate, after conversion by decarbox- 
ylation in the presence of acid to acetoin. Standard reaction mixtures contain the enzyme in 50 mM 
potassium phosphate (pH 7.0) containing 100 mM sodium pyruvate. 10 mM MgCb, 1 mM thiamine 
pyrophosphate, 10 uM FAD, and appropriate concentrations of different inhibitors. This mixture is incubated 
at 37°C for 1 to 3 hours depending upon the experiment. At the end of this incubation period, the reaction is 

45 stopped with the addition of H ? SC>4 to make a final concentration of 0.85% H 2 SO« in the tube. The reaction 
product is allowed to decarboxylate at 60°C for 15 minutes. The acetoin formed is determined by incubating 
with creatine (0.17%) and 1-naphthol (1.7% in 4N NaOH). The absorption of color complex formed is 
measured at 520 nm. 

AHAS activity from B73, A619, or other wild-type maize lines is highly sensitive to inhibition by 
so imazethapyr (PURSUIT ™ ) with an hso of 1 uM (See Figure 1). Contrary to this observation, XI12 shows 70- 
80% of enzyme activity at the highest concentrations (100 uM) of PURSUIT™ or ARSENAL™ (imazepyr), 
and about 70% in the presence of SCEPTER™ (imazequin). This result shows a 100-fold increase in 
tolerance of AHAS activity from XI 1 2 to imazethapyr as compared to the in vitro AHAS activity from A619. 
sensitivity of AHAS activity from the two lines to sulfonylureas gives a different picture. In the presence of 
55 OUST™ (sulfometuron methyl), at 100 nM, AHAS activity of XI12 is only 15-20%. AHAS activity of A619 in 
the presence of OUST™ IS 5-10%, and in the presence of PURSUIT™ is 15-20% (See Figure 1). 

3. Cloning of XI12 AHAS Genes 
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Seeds of the XI12 mutant derived from an imidazollnone resistant corn tissue culture line are planted; 
plants obtained therefrom appear to be segregating for the mutant AHAS phenotype. In order to obtain 
homozygous resistant seed material, a population of XII 2 mutant plants are selled. After selecting for 
herbicide resistance tor three consecutive growing seasons, the seeds are homozygous for the mutant 
s AHAS gene. Homozygous seeds are collected and used to grow seedlings to be used in AHAS gene 
isolation. 

DNA is extracted from 7 days old etiolated seedlings of a homozygous XI12 line. 60 g of plant tissue is 
powdered in liquid nitrogen, and transfered into 108 ml DNA extraction buffer (1.4 M NaCI. 2.0% Ctab 
(hexadecyl trimethyl ammonium bromide), 100 mM tris-CI pH 8.0, 20 mM EDTA, 2% Mercaptoethanol) and 
to 54 ml water. After incubation at 50-60°C for 30 minutes the suspension is extracted with an equal amount of 
chloroform. The DNA is precipitated by adding an equal amount of precipitation buffer (1% Ctab, 50 mM 
Tris-CI pH 8.0, 10 mM EDTA). To purify the genomic DNA, a high speed centrifugation in 6.6M CsCI and 
ethidium bromide is performed (Ti80 rotor, 50,000 rpm, 20°C, 24 hours). The purified DNA is extracted with 
salt saturated Butanol and dialyzed for 25 hours against 3 changes of 1 I dialysis buffer (10 mM Tris-CI Ph 
is 8.0, 1 mM EDTA, 0.1 M NaCI). The concentration of the XI12 genomic DNA is determined spec- 
trophotometrically to be 310 mg/rnl. The yield is 0.93 mg. 

The XII 2 genomic DNA is used to create a genomic library in the phage EMBL-3. The DNA is partially 
*— digested with MboFand' the fragments are separated" on"* sucrose gfm^tno^prmiicWs^Wg&'^e^ieeri 
8 to 22 kb before cloning into the BamHl site of EMBL-3. After obtaining 2.1 x 10* independent clones, the 
so library is amplified once. The titer of the library is determined 9 x 10 10 pfu/ml. 

To obtain probes for analysis of the XI12 library, a W22 (wild-type) cDNA library in lambda gt11, 
purchased from Clontech Inc., CA, is screened with an 800 nt BamHl probe isolated from Arabidopsis 
AHAS genomic clone. The phages are plated in a density of 50.000 pfu/15 cm plate, transferred onto 
nitrocellulose filters, prehybridized in 6x SSC, 0.2% SDS for 2 hours and hybridized with the Arabidopsis 
25 AHAS probe in 6x SSC. 0.2% SDS overnight. One positive phage is identified, further purified and used for 
subcloning of a 1.1 kb EcoRI fragment. The 1.1 kb EcoRI fragment is subcloned into pGemA-4 and used as 
a probe to identify the XI12 AHAS genes. 

The XI12 genomic library is plated on 12 15-cm plates (concentration of 50,000 pfu/plate) and is 
screened with the W22 AHAS cDNA probe. The filters are prehybridized (2 hours) and hybridized (over 
30 night) in Church buffer (0.5 M Na Phosphate, 1 mM EDTA, 1% BSA, 7% SDS) at 65°C and washed at B5°C 
in 2x SSC, 0.2% SDS and 0.3 x SSC, 0.2% SDS. 12 positive plaques are obtained from a total of 7.5 x 10 5 
pfu screened and 5 positive clones are further purified and isolated according to Chisholm (BioTechniques 
7:21-23, 1989). Southern analysis (See Figure 2) showed that the phage clones represented two types of 
AHAS clones: Type-1 clones contain one large Xbal (>6.5 kb) fragment hybridizing to the AHAS cDNA 
35 probe, Type-2 clones contained two 2.7 and 3.7 kb Xbal fragments hybridizing to the AHAS cDNA probe. 
Genomic Southern of XII 2 DNA demonstrated, that the Xbal fragments obtained by digesting genomic DNA 
and by hybridizing to the AHAS cDNA probe correspond to the Xbal fragments identified in the XI12 phage 
clones (See Figure 3). Restriction digest and Southern Analysis also demonstrate that of the 5 AHAS 
clones, one clone represents the mutant als2 genes and four represent the alsl gene. 
40 The AHAS genes corresponding to the mutant locus located on chromosome 5 (done 12/8A) and the 
non-mutant locus located on chromosome 4 (clone 12/17A) are subcloned as a Pstl fragment (clone 12/8A) 
or as Xbal fragment (12/17A) into the sequencing vector pBluescript II KSm13( + ) (pKS + ; Stratagene). Both 
2.7 kb and 3.7 kb Xbal fragments from phage 12/17A contain one complete copy of AHAS genes which are 
identified. The sequence of each is obtained by dideoxy sequencing (Pharmacia T7 sequencing Kits) using 
is primers complementary to the AHAS coding sequence. 

The methods of DNA extraction, cloning of the genomic library and screening of the library are as 
described for the XII 2 genomic DNA. The B73 AHAS genes are subcloned into the sequencing vector 
pKS+ as Xbal fragments and are sequenced. The sequence is obtained by dideoxy sequencing, using 
primers complementary to the AHAS coding sequence as described for the S1 12 AHAS genes, 
so A W22 genomic library in EMBL3 purchased from Clontech Inc.. CA is screened. The phages are 
plated in a density of 50,000 pfu/7 inch plate, transferred onto nitrocellulose filters, and hybridized with the 
W22 AHAS cDNA probe described above (prehybridization and hybridization conditions: 6 x SSC, 0.5% 
SDS, 1X Denhard-s 100 mg/ml calf thymus DNA. B5°C, washing conditions: 3X x SSC, 0.2% SDS for 2 
hours at 65°C, and 0.3 x SSC, 0.2% SDS for 2 hours). Two positive phages (12/1 A and 12/4-4) are identified 
55 and further purified. 

The W22 genomic clone 12/1 A is subcloned as two 0.78 kb (pGemA-4) and 3.0 kb (pGemA-14; 
Promega) Sail fragments into the vector pGem-A2, and as a 6.5 kb Xbal fragment into the vector pKS + 
(pCD200). The coding strand sequence ol the W22 AHAS gene is obtained by dideoxy sequencing of 
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nested deletions created from subclones pGem A-14 and pGem A-4 of phage 12-1 A. This sequence is used 
to design oligonucleotides complementary to the AHAS non-coding strand. The sequence of the non-coding 
strand is obtained by dideoxy sequencing of clone pCD200 using primers complementary to the coding 
strand. Upon complementing the sequencing of the W22 AHAS gene, primers of both DIMA strands are 
5 designed and used for ihe sequencing of the AHAS genes isolated from the XI12 and B73 genomic 
libraries. 



4. Cloning of QJ22 AHAS Genes 



io The sequence of the gene encoding AHAS in the maize line QJ22, which is selectively resistant to 
imidazolinones, is also determined. A genomic library of QJ22 is prepared in an EMBL3 vector. A library of 
800,000 phage is screened with an 850 nucleotide Sall/Clal fragment isolated from an AHAS clone <A-4) 
dorived from the wild-type maizo line W22. Five positive phages are picked and submitted to a second 
round of screening to partially purify the phage. The partially purified phage are analyzed by PCR to 

15 determine if any clones represent the QJ22 alsl gene. Such clones are identified as a 3.7kb Xbal fragment 
with a gene specific Smal site at position "495. The second screen indicates the presence of a single 
positive clone with these characteristics. 

■vaas$te^Q*p«&&ki%VuM&iushw a commercial. kiHMagic-PCR Preps) from Promegarand-the'pufified"- 
DNA is sequenced with a Taq polymerase sequencing system "fmoP, also from Promega Sequence 
so analysis of both strands of the DNA of the QJ22 mutant AHAS shows a nucleotide transition from G to A in 
the codon for amino acid 621. This mutation is identical to the one seen in XI12 and the remainder of the 
sequence is typical of an alsl gene. 

RESULTS 



The sequence of the mutant AHAS genes is compared with the sequences obtained from the wild type 
corn lines B73 and W22 (See Figure 7). The XI12 mutant gene (XH2/8A) and the QJ22 mutant gene and 
the wild type gene are identical except for the amino acid change at position 621, causing a single 
nucleotide transition from AGT to AAT (See Figure 8). The XI12 mutant XI12/8A and the wild-type B73/7-4 
30 gene show an additional difference at position 63. On the other hand, the non-mutant X1 12 AHAS gene 
cloned in XI12/17A is completely homologous to the corresponding B73/10-2 in the region coding for the 
mature AHAS protein (data not shown). 

DEPOSIT OF BIOLOGICAL MATERIALS 

35 

The following biological materials were deposited with the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland, 20857. as follows: 

E. coli XLI Blue harboring plasmid pX12/8A, deposited on July 3, 1991, Accession Number ATCC 
68643 

40 XI12 corn seed deposited on July 16, 1991, Accession Number ATCC 75051. 



50 
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sequence Listings 

Sequence ID No. : l 

Sequence Type: Nucleotide and Amino Acid 
Sequence Length: 1969 BP'S and 638 Amino Acids 
strandedness: Single 
Topology: Linear 

original Source Organism: Zea mays 
Properties: Herbicide Resistant AHAS Enzyme 

AACCCTCGCG CCGCCTCCGA GACAGCCGCC GCAACC 

ATG GCC ACC GCC GCC GCC GCG TCT ACC GCG CTC ACT 
Met Ala Thr Ala Ala Ala Ala Ser Thr Ala Leu Thr 
15 10 



GGC GCC ACT ACC GCT GCG CCC AAG GCG AGG CGC CGG 108 
Gly Ala Thr Thr Ala Ala Pro Lys Ala Arg Arg Arg 

15 20 

GCG CAC CTC CTG GCC ACC CGC CGC GCC CTC GCC GCG 144 
Ala His Leu Leu Ala Thr Arg Arg Ala Leu Ala Ala 
25 30 35 

CCC ATC AGG TGC TCA GCG GCG TCA CCC GCC ATG CCG 180 
Pro lie Arg Cys Ser Ala Ala Ser Pro Ala Met Pro 
40 45 
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ATG GCT CCC CCG GCC ACC CCG CTC CGG CCG TGG GGC 

Met Ala Pro Pro Ala Thr Pro Leu Arg Pro Trp Gly 

50 55 60 

CCC ACC GAT CCC CGC AAG GGC GCC GAC ATC CTC GTC 

Pro Thr Asp Pro Arg Lys Gly Ala Asp lie Leu Val 



GAG TCC CTC GAG CGC TGC GGC GTC CGC GAC GTC TTC 

..Glu .Ser-LeUiGlu^Arg^X^s-Gay^ Vai^Arg Asp Val.-*Bhe 



GCC TAC CCC GGC GGC GCG TCC ATG GAG ATC CAC CAG 
Ala Tyr Pro Gly Gly Ala Ser Met Glu He His Gin 



GCA CTC ACC CGC TCC CCC GTC ATC GCC AAC CAC CTC 

Ala Leu Thr Arg Ser Pro Val He Ala Asn His Leu 

100 105 

TTC CGC CAC GAG CAA GGG GAG GCC TTT GCG GCC TCC 
Phe Arg His Glu Gin Gly Glu Ala Phe Ala Ala Ser 
110 115 120 

GGC TAC GCG CGC TCC TCG GGC CGC GTC GGC GTC TGC 

Gly Tyr Ala Arg ser ser Gly Arg Val Gly Val Cys 
125 130 

ATC GCC ACC TCC GGC CCC GGC GCC ACC AAC CTT GTC 
lie Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
135 140 

TCC GCG CTC GCC GAC GCG CTG CTC GAT TCC GTC CCC 
Ser Ala Leu Ala Asp Ala Leu Leu Asp Ser Val Pro 
145 150 155 
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ATG GTC GCC ATC ACG GGA CAG GTG CCG CGA CGC ATG 

Met val Ala lie Thr Gly Gin val Pro Arg Arg Met 

160 165 

ATT GGC ACC GAC GCC TTC CAG GAG ACG CCC ATC GTC 
He Gly Thr Asp Ala Phe Gin Glu Thr Pro He Val 
170 17S 180 

GAG GTC ACC CGC TCC ATC ACC AAG CAC AAC TAC CTG 

^G.l,u ^Val .-3*r^rg-'f&.er«ll-e i i^Chr Lys His sAsn....Tyr Leu 
185 190 

GTC CTC GAC GTC GAC GAC ATC CCC CGC GTC GTG CAG 
Val Leu Asp Val Asp Asp He Pro Arg Val Val Gin 
195 200 

GAG GCT TTC TTC CTC GCC TCC TCT GGT CGA CCG GGG 
Glu Ala Phe Phe Leu Ala Ser Ser Gly Arg Pro Gly 
205 210 215 

CCG GTG CTT GTC GAC ATC CCC AAG GAC ATC CAG CAG 
Pro Val Leu Val Asp He Pro Lys Asp He Gin Gin 
220 225 

CAG ATG GCG GTG CCT GTC TGG GAC AAG CCC ATG AGT 
Gin Met Ala val Pro Val Trp Asp Lys Pro Met Ser 
230 235 240 

CTG CCT GGG TAC ATT GCG CGC CTT CCC AAG CCC CCT 
Leu Pro Gly Tyr He Ala Arg Leu Pro Lys Pro Pro 
245 250 

GCG ACT GAG TTG CTT GAG CAG GTG CTG CGT CTT GTT 
Ala Thr Glu Leu Leu Glu Gin Val Leu Arg Leu Val 
255 260 
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GGT GAA TCC CGG CGC CCT GTT CTT TAT GTT GGC GGT 
Gly Glu Ser Arg Arg Pro Val Leu Tyr Val Gly Gly 
265 270 275 

GCG TGC GCA GCA TCT GGT GAG GAG TTG CGA CGC TTT 
Ala Cys Ala Ala Ser Gly Glu Glu Leu Arg Arg Phe 
280 285 

GTG GAG CTG ACT GGA ATC CCG GTC ACA ACT ACT CTT 
val Glu Leu Thr Gly He Pro Val Thr Thr Thr Leu 



ATG GGC CTC GGC AAC TTC CCC AGC GAC GAC CCA CTG 

Met Gly Leu Gly Asn Phe Pro Ser Asp Asp Pro Leu 

305 310 

TCT CTG CGC ATG CTA GGT ATG CAT GGC ACG GTG TAT 
Ser Leu Arg Met Leu Gly Met His Gly Thr Val Tyr 

315 320 

GCA AAT TAT GCA GTG GAT AAG GCC GAT CTG TTG CTT 
Ala Asn Tyr Ala val Asp Lys Ala Asp Leu Leu Leu 
325 330 335 

GCA CTT GGT GTG CGG TTT GAT GAT CGT GTG ACA GGG 
Ala Leu Gly Val Arg Phe Asp Asp Arg Val Thr Gly 
340 345 

AAG ATT GAG GCT TTT GCA AGC AGG GCT AAG ATT GTG 
Lys He Glu Ala Phe Ala Ser Arg Ala Lys lie Val 
350 355 360 

CAC GTT GAT ATT GAT CCG GCT GAG ATT GGC AAG AAC 
His val Asp lie Asp Pro Ala Glu He Gly Lys Asn 
365 370 
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AAG CAG CCA CAT GTG TCC ATC T6T GCA GAT GTT AAG 
Lys Gin Pro His Val Ser lie Cys Ala Asp val Lys 
375 380 

CTT GCT TTG CAG GGC ATG AAT GCT CTT CTT GAA GGA 
Leu Ala Leu Gin Gly Met Asn Ala Leu Leu Glu Gly 
385 390 395 

AGC ACA TCA AAG AAG AGC TTT GAC TTT GGC TCA TGG 
Ser Thr Ser Lys Lys Ser Phe Asp Phe Gly Ser Trp 
400 405 

AAC GAT GAG TTG GAT CAG CAG AAG AGG GAA TTC CCC 
Asn Asp Glu Leu Asp Gin Gin Lys Arg Glu Phe Pro 
410 415 420 

CTT GGG TAT AAA ACA TCT AAT GAG GAG ATC CAG CCA 

Leu Gly Tyr Lys Thr Ser Asn Glu Glu lie Gin Pro 

425 430 

CAA TAT GCT ATT CAG GTT CTT GAT GAG CTG ACG AAA 
Gin Tyr Ala He Gin Val Leu Asp Glu Leu Thr Lys 

435 440 

GGC GAG GCC ATC ATC GGC ACA GGT GTT GGG CAG CAC 
Gly Glu Ala He He Gly Thr Gly Val Gly Gin His 
445 450 455 

CAT ATG TGG GCG GCA CAG TAC TAC ACT TAC AAG CGG 
Gin Met Trp Ala Ala Gin Tyr Tyr Thr Tyr Lys Arg 
460 465 

CCA AGG CAG TGG TTG TCT TCA GCT GGT CTT GGG GCT 
Pro Arg Gin Trp Leu Ser Ser Ala Gly Leu Gly Ala 
470 475 480 
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ATG GGA TTT GGT TTG CCG GCT GCT GCT GGT GCT TCT 
Met Gly Phe Gly Leu Pro Ala Ala Ala Gly Ala Ser 
485 490 

GTG GCC AAC CCA GGT GTT ACT GTT GTT GAC ATC GAT 
Val Ala Asn Pro Gly Val Thr Val Val Asp lie Asp 
495 500 

GGA GAT GGT AGC TTT CTC ATG AAC GTT CAG GAG CTA 
Gly Asp Gly Ser Phe Leu Met Asn Val Gin Glu Leu 
505 510 515 

GCT ATG ATC CGA ATT GAG AAC CTC CCG GTG AAG GTC 

Ala Met lie Arg He Glu Asn Leu Pro Val Lys Val 

520 525 

TTT GTG CTA AAC AAC CAG CAC CTG GGG ATG GTG GTG 
Phe Val Leu Asn Asn Gin His Leu Gly Met Val Val 
530 535 540 

CAG TGG GAG GAC AGG TTC TAT AAG GCC AAC AGA GCG 

Gin Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala 

545 550 

CAC ACA TAC TTG GGA AAC CCA GAG AAT GAA AGT GAG 
His Thr Tyr Leu Gly Asn Pro Glu Asn Glu ser Glu 
555 560 

ATA TAT CCA GAT TTC GTG ACG ATC GCC AAA GGG TTC 
He Tyr Pro Asp Phe val Thr lie Ala Lys Gly Phe 
565 570 575 

AAC ATT CCA GCG GTC CGT GTG ACA AAG AAG AAC GAA 
Asn He Pro Ala Val Arg Val Thr Lys Lys Asn Glu 
580 585 
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GTC CGC GCA GCG ATA AAG AAG ATG CTC GAG ACT CCA 
Val Arg Ala Ala He Lys Lys Met Leu Glu Thr Pro 
590 595 600 

GGG CCG TAC CTC TTG GAT ATA ATC GTC CCA CAC CAG 
Gly Pro Tyr Leu Leu Asp lie lie Val Pro His Gin 

605 610 

GAG CAT GTG TTG CCT ATG ATC CCT AAT GGT GGG GCT 
Glu His Val Leu Pro Met lie Pro Asn Gly Gly Ala 
615 620 



TTC AAG GAT ATG ATC CTG GAT GGT GAT GGC AGG ACT 
Phe Lys Asp Met lie Leu Asp Gly Asp Gly Arg Thr 
625 630 635 

GTG TAC 
val Tyr 
638 

TGATCTAAAA TCCAGCAAG 



40 



45 



50 



55 
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Sequence ID Ho. : 2 

Sequence Type: Nucleotide and Amino Acid 
Sequence Length: 1969 BP'S and 638 Amino Acids 
Strandedness: Single 
Topology: Linear 
_Original Source, Organism: Zea . mays . . 
Properties: Herbicide Sensitive AHAS Enzyme 

AACCCTCGCG CCGCCTCCGA GACAGCCGCC GCAACC 

ATG GCC ACC GCC GCC GCC GCG TCT ACC GCG CTC ACT 
Met Ala Thr Ala Ala Ala Ala Ser Thr Ala Leu Thr 



GGC GCC ACT ACC GCT GCG CCC AAG GCG AGG CGC CGG 
Gly Ala Thr Thr Ala Ala Pro Lys Ala Arg Arg Arg 



GCG CAC CTC CTG GCC ACC CGC CGC GCC CTC GCC GCG 144 
Ala His Leu Leu Ala Thr Arg Arg Ala Leu Ala Ala 
25 30 35 

CCC ATC AGG TGC TCA GCG GCG TCA CCC GCC ATG CCG 180 
Pro lie Arg Cys Ser Ala Ala Ser Pro Ala Met Pro 



ATG GCT CCC CCG GCC ACC CCG CTC CGG CCG TGG GGC 

Met Ala Pro Pro Ala Thr Pro Leu Arg Pro Trp Gly 
50 55 60 
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CCC ACC GAT CCC CGC AAG GGC GCC GAC ATC CTC GTC 
Pro Thr Asp Pro Arg Lys Gly Ala Asp He Lou Val 
65 70 

GAG TCC CTC GAG CGC TGC GGC GTC CGC GAC GTC TTC 
Glu ser Leu Glu Arg cys Gly Val Arg Asp Val Phe 
75 80 

GCC TAC CCC GGC GGC GCG TCC ATG GAG ATC CAC CAG 
Ala Tyr Pro Gly Gly Ala Ser Met Glu He His Gin 

85 " a3ar= - 90 ■"■ ~~ *=------- -$5 -_- 

GCA CTC ACC CGC TCC CCC GTC ATC GCC AAC CAC CTC 
Ala Leu Thr Arg Ser Pro Val lie Ala Asn His Leu 
10 0 105 

TTC CGC CAC. GAG CAA GGG GAG GCC TTT GCG GCC TCC 
Phe Arg His Glu Gin Gly Glu Ala Phe Ala Ala Ser 

110 H5 120 

GGC TAC GCG CGC TCC TCG GGC CGC GTC GGC GTC TGC 
Gly Tyr Ala Arg Ser ser Gly Arg val Gly Val cys 
125 130 

ATC GCC ACC TCC GGC CCC GGC GCC ACC AAC CTT GTC 
He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
135 140 

TCC GCG CTC GCC GAC GCG CTG CTC GAT TCC GTC CCC 
Ser Ala Leu Ala Asp Ala Leu Leu Asp Ser Val Pro 
145 150 155 

ATG GTC GCC ATC ACG GGA CAG GTG CCG CGA CGC ATG 
Met val Ala He Thr Gly Gin Val Pro Arg Arg Met 
160 165 
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ATT GGC ACC GAC GCC TTC CAG GAG ACG CCC ATC GTC 

He Gly Thr Asp Ala Phe Gin Glu Thr Pro He Val 
170 175 180 



GAG GTC ACC CGC TCC ATC ACC AAG CAC AAC TAC CTG 
Glu Val Thr Arg ser He Thr Lys His Asn Tyr Leu 
185 190 



GTC CTC GAC GTC GAC GAC ATC CCC CGC GTC GTG CAG 
Val Leu Asp TVal- -Asp Asp Ile^ProArg Val Val Gin 
195 200 

GAG GCT TTC TTC CTC GCC TCC TCT GGT CGA CCG GGG 
Glu Ala Phe Phe Leu Ala Ser Ser Gly Arg Pro Gly 
205 210 215 

CCG GTG CTT GTC GAC ATC CCC AAG GAC ATC CAG CAG 
Pro Val Leu Val Asp He Pro Lys Asp He Gin Gin 
220 225 



CAG ATG GCG GTG CCT GTC TGG GAC AAG CCC ATG AGT 
Gin Met Ala Val Pro Val Trp Asp Lys Pro Met ser 
230 235 240 



CTG CCT GGG 
Leu Pro Gly 

GCG ACT GAG 
Ala Thr Glu 
255 

GGT GAA TCC 
Gly Glu Ser 
265 



TAC ATT GCG 

Tyr He Ala 
245 

TTG CTT GAG 
Leu Leu GlU 



CGG CGC CCT 
Arg Arg Pro 
270 



CGC CTT CCC 
Arg Leu Pro 

CAG GTG CTG 
Gin Val Leu 
2 50 

GTT CTT TAT 
Val Leu Tyr 



AAG CCC CCT 
Lys Pro Pro 
250 

CGT CTT GTT 
Arg Leu Val 

GTT GGC GGT 
Val Gly Gly 
275 
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GCG TGC GCA GCA TCT GGT GAG GAG TTG CGA CGC TTT 
Ala Cys Ala Ala Ser Gly Glu Glu Leu Arg Arg Phe 

280 285 

GTG GAG CTG ACT GGA ATC CCG GTC ACA ACT ACT CTT 
Val Glu Leu Thr Gly lie Pro val Thr Ttor Thr Leu 

290 295 300 

ATG GGC CTC GGC AAC TTC CCC AGC GAC GAC CCA CTG 
Met Gly Leu Gly Asn Phe Pro Ser Asp Asp Pro Leu 
305 3X0 

TCT CTG CGC ATG CTA GGT ATG CAT GGC ACG GTG TAT 
Ser Leu Arg Met Leu Gly Met His Gly Thr Val Tyr 
315 320 

GCA AAT TAT GCA GTG GAT AAG GCC GAT CTG TTG CTT 
Ala Asn Tyr Ala Val Asp Lys Ala Asp Leu Leu Leu 
325 330 335 

GCA CTT GGT GTG CGG TTT GAT GAT CGT GTG ACA GGG 

Ala Leu Gly Val Arg Phe Asp Asp Arg Val Thr Gly 

340 345 

AAG ATT GAG GCT TTT GCA AGC AGG GCT AAG ATT GTG 
Lys lie Glu Ala Phe Ala Ser Arg Ala Lys lie Val 
350 355 360 

CAC GTT GAT ATT GAT CCG GCT GAG ATT GGC AAG AAC 
His Val Asp lie Asp Pro Ala Glu lie Gly Lys Asn 
365 370 

AAG CAG CCA CAT GTG TCC ATC TGT GCA GAT GTT AAG 

Lys Gin Pro His val ser lie Cys Ala Asp val Lys 
375 380 
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CTT GCT TTG CAG GGC ATG AAT GCT CTT CTT GAA GGA 

Leu Ala Leu Gin Gly Met Asn Ala Leu Leu Glu Gly 
385 390 395 

AGO ACA TCA AAG AAG AGC TTT GAG TTT GGC TCA TGG 
Ser Thr Ser Lys Lys Ser Phe Asp Phe Gly Ser Trp 
400 405 

AAC GAT GAG TTG GAT CAG CAG AAG AGG GAA TTC CCC 
Asn Asp Glu Leu Asp Gin Gin Lys Arg Glu Phe Pro 
410 415 420 

CTT GGG TAT AAA ACA TCT AAT GAG GAG ATC CAG CCA 
Leu Gly Tyr Lys Thr Ser Asn Glu Glu lie Gin Pro 
425 430 

CAA TAT GCT ATT CAG GTT CTT GAT GAG CTG ACG AAA 

Gin Tyr Ala He Gin Val Leu Asp Glu Leu Thr Lys 

435 440 

GGC GAG GCC ATC ATC GGC ACA GGT GTT GGG CAG CAC 

Gly Glu Ala He He Gly Thr Gly Val Gly Gin His 

445 450 455 

CAT ATG TGG GCG GCA CAG TAG TAC ACT TAC AAG CGG 

Gin Met Trp Ala Ala Gin Tyr Tyr Thr Tyr Lys Arg 

460 465 

CCA AGG CAG TGG TTG TCT TCA GCT GGT CTT GGG GCT 
Pro Arg Gin Trp Leu Ser Ser Ala Gly Leu Gly Ala 
470 475 480 

ATG GGA TTT GGT TTG CCG GCT GCT GCT GGT GCT TCT 
Met Gly Phe Gly Leu Pro Ala Ala Ala Gly Ala Ser 
485 490 
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6TG GCC AAC CCA GGT GTT ACT GTT GTT GAC ATC GAT 

val Ala Asn Pro Gly val Thr Val val Asp lie Asp 
495 500 

GGA GAT GGT AGC TTT CTC ATG AAC GTT CAG GAG CTA 
Gly Asp Gly Ser Phe Leu Met Asn Val Gin Glu Leu 
505 510 515 

GCT ATG ATC CGA ATT GAG AAC CTC CCG GTG AAG GTC 
Ala Met lie Arg lie Glu Asn Leu Pro Val Lys Val 
520 525 

TTT GTG CTA AAC AAC CAG CAC CTG GGG ATG GTG GTG 
Phe Val Leu Asn Asn Gin His Leu Gly Met Val Val 
530 535 540 

CAG TGG GAG GAC AGG TTC TAT AAG GCC AAC AGA GCG 

Gin Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala 

545 550 
CAC ACA TAC TTG GGA AAC CCA GAG AAT GAA AGT GAG 

His Thr Tyr Leu Gly Asn Pro Glu Asn Glu Ser Glu 

555 560 

ATA TAT CCA GAT TTC GTG ACG ATC GCC AAA GGG TTC 
lie Tyr Pro Asp Phe val Thr lie Ala Lys Gly Phe 
565 570 575 

AAC ATT CCA GCG GTC CGT GTG ACA AAG AAG AAC GAA 
Asn lie Pro Ala Val Arg Val Thr Lys Lys Asn Glu 
580 585 

GTC CGC GCA GCG ATA AAG AAG ATG CTC GAG ACT CCA 
Val Arg Ala Ala He Lys Lys Met Leu Glu Thr Pro 
590 595 600 
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GGG CCG TAC CTC TTG GAT ATA ATC GTC CCA CAC CAG 
Gly Pro Tyr Leu Leu Asp lie lie Val Pro His Gin 
605 610 

GAG CAT GTG TTG CCT ATG ATC CCT AGT GGT GGG GCT 
Glu His Val Leu Pro Met lie Pro Ser Gly Gly Ala 

615 620 
TTC AAG GAT ATG ATC CTG GAT GGT GAT GGC AGG ACT 

Phe Lys Asp Met lie Leu Asp Gly Asp Gly Arg Thr 

625 630 635 

GTG TAC 
val Tyr 
638 

TGATCTAAAA TCCAGCAAG 



35 



65 
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Sequence ID No. : 3 

Sequence Type: Nucleotide and Amino Acid 
Sequence Length: 1969 BP'S and 638 Amino Acids 
Strandedness: single 
Topology: Linear 

original Source- Organism:- Zea mays ■^=^~- 
Properties: Herbicide sensitive AHAS Enzyme 

AACCCTCGCG CCGCCTCCGA GACAGCCGCC GCAACC 

ATG GCC ACC GCC GCC GCC GCG TCT ACC GCG CTC ACT 

Met Ala Thr Ala Ala Ala Ala Ser Thr Ala Leu Thr 



GGC GCC ACT ACC GCT GCG CCC AAG GCG AGG CGC CGG 
Gly Ala Thr Thr Ala Ala Pro Lys Ala Arg Arg Arg 



GCG CAC CTC CTG GCC ACC CGC CGC GCC CTC GCC GCG 
Ala His Leu Leu Ala Thr Arg Arg Ala Leu Ala Ala 



CCC ATC AGG TGC TCA GCG GCG TCA CCC GCC ATG CCG 
Pro He Arg Cys Ser Ala Ala Ser Pro Ala Met Pro 
40 45 

ATG GCT CCC CCG GCC ACC CCG CTC CGG CCG TGG GGC 
Met Ala Pro Pro Ala Thr Pro Leu Arg Pro Trp Gly 
50 55 60 
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CCC ACC GAG CCC CGC AAG GGT GCT GAC ATC CTC GTC 
Pro Thr Glu Pro Arg Lys Gly Ala Asp lie Leu Val 
65 70 

GAG TCC CTC GAG CGC TGC GGC GTC CGC GAC GTC TTC 
Glu Ser Leu Glu Arg cys Gly Val Arg Asp Val Phe 



GCC TAC CCC GGC GGC GCG TCC ATG GAG ATC CAC CAG 

Ala Tyr Pro Gly Gly Ala Ser Met Glu lie His Gin 

— 85 • 1 " 9CF-- >«=*s*™ ^ v^ ^ gy^^. 

GCA CTC ACC CGC TCC CCC GTC ATC GCC AAC CAC CTC 

Ala Leu Thr Arg Ser Pro Val lie Ala Asn His Leu 
100 105 

TTC CGC CAC GAG CAA GGG GAG GCC TTT GCC GCC TCC 

Phe Arg His Glu Gin Gly Glu Ala Phe Ala Ala Ser 
110 115 120 

GGC TAC GCG CGC TCC TCG GGC CGC GTC GGC GTC TGC 

Gly Tyr Ala Arg ser Ser Gly Arg Val Gly Val Cys 
125 130 

ATC GCC ACC TCC GGC CCC GGC GCC ACC AAC CTA GTC 

He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
135 140 

TCC GCG CTC GCC GAC GCG CTG CTC GAT TCC GTC CCC 

Ser Ala Leu Ala Asp Ala Leu Leu Asp Ser Val Pro 
145 150 155 

ATG GTC GCC ATC ACG GGA CAG GTG CCG CGA CGC ATG 

Met Val Ala He Thr Gly Gin Val Pro Arg Arg Met 
160 165 
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ATT GGC ACC GAC GCC TTC CAG GAG ACG CCC ATC GTC 
Trp Gly Thr Asp Ala Phe Gin Glu Thr Pro lie Val 
170 17 5 180 

GAG GTC ACC CGC TCC ATC ACC AAG CAC AAC TAC CTG 
Glu Val Thr Arg Ser lie Thr Lys His Asn Tyr Leu 
185 190 

GTC CTC GAC GTC GAC GAC ATC CCC CGC GTC GTG CAG 
Val Leu Asp Val Asp Asp He Pro Arg Val Val Gin 
195 200 

GAG GCT TTC TTC CTC GCC TCC TCT GGT CGA CCA GGG 
Glu Ala Phe Phe Leu Ala Ser Ser Gly Arg Pro Gly 
205 210 215 

CCG GTG CTT GTC GAC ATC CCC AAG GAC ATC CAG CAG 
Pro Val Leu Val Asp He Pro Lys Asp lie Gin Gin 
220 225 

CAG ATG GCG GTG CCT GTC TGG GAC AAG CCC ATG AGT 

Gin Met Ala Val Pro Val Trp Asp Lys Pro Met ser 

230 235 240 

CTG CCT GGG TAC ATT GCG CGC CTT CCC AAG CCC CCT 
Leu Pro Gly Tyr lie Ala Arg Leu Pro Lys Pro Pro 
245 250 

GCG ACT GAG TTG CTT GAG CAG GTG CTG CGT CTT GTT 
Ala Thr Glu Leu Leu Glu Gin val Leu Arg Leu Val 
255 260 

GGT GAA TCG CGG CGC CCT GTT CTT TAT GTG GGC GGT 
Gly Glu Ser Arg Arg Pro Val Leu Tyr Val Gly Gly 
265 270 275 
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GCG TGC GCA GCA TCT GGT GAG GAG TTG CGA CGC TTT 
Ala Cys Ala Ala Ser Gly Glu Glu Leu Arg Arg Phe 

280 285 
GTG GAG CTG ACT GGA ATC CCG GTC ACA ACT ACT CTT 

val Glu Leu Thr Gly lie Pro val Thr Thr Thr Leu 
290 295 300 

ATG GGC CTC GGC AAC TTC CCC AGC GAC GAC CCA CTG 
Met Gly Leu Gly Asn Phe Pro Ser Asp Asp Pro Leu 
305 310 

TCT CTG CGC ATG CTA GGT ATG CAT GGG ACG GTG TAT 
Ser Leu Arg Met Leu Gly Met His Gly Thr Val Tyr 
315 320 

GCA AAT TAT GCA GTG GAT AAG GCC GAT CTG TTG CTT 
Ala Asn Tyr Ala Val Asp Lys Ala Asp Leu Leu Leu 
325 330 335 

GCA CTT GGT GTG CGG TTT GAT GAT CGT GTG ACA GGG 
Ala Leu Gly Val Arg Phe Asp Asp Arg Val Thr Gly 
340 345 

AAG ATT GAG GCT TTT GCA AGC AGG GCT AAG ATT GTG 
Lys lie Glu Ala Phe Ala Ser Arg Ala Lys He Val 
350 355 360 

CAC GTT GAT ATT GAT CCG GCT GAG ATT GGC AAG AAC 
His Val Asp He Asp Pro Ala Glu He Gly Lys Asn 
365 370 

AAG CAG CCA CAT GTG TCC ATC TGT GCA GAT GTT AAG 
Lys Gin Pro His Val Ser He Cys Ala Asp val Lys 
375 380 



26 



EP 0 525 384 A2 



CTT GCT TTG CAG GGC ATG AAT GCT CTT CTT GAA GGA 
Leu Ala Leu Gin Gly Met Asn Ala Leu Leu Glu Gly 
385 390 395 

AGC ACA TCA AAG AAG AGC TTT GAC TTT GGC TCA TGG 
Ser Thr Ser Lys Lys Ser Phe Asp Phe Gly Ser Trp 
400 405 

AAC GAT GAG TTG GAT CAG CAG AAG AGG GAA TTC CCC 
Asn Asp Glu Leu Asp Gin Gin Lys Arg Glu Phe Pro 
410 415 420 

CTT GGG TAT AAA ACA TCT AAT GAG GAG ATC CAG CCA 
Leu Gly Tyr Lys Thr Ser Asn Glu Glu He Gin Pro 
425 430 

CAA TAT GCT ATT CAG GTT CTT GAT GAG CTG ACG AAA 
Gin Tyr Ala He Gin Val Leu Asp Glu Leu Thr Lys 
435 440 

GGC GAG GCC ATC ATC GGC ACA GGT GTT GGG CAG CAC 
Gly Glu Ala He He Gly Thr Gly Val Gly Gin His 
445 450 455 

CAT ATG TGG GCG GCA CAG TAC TAC ACT TAC AAG CGG 

Gin Met Trp Ala Ala Gin Tyr Tyr Thr Tyr Lys Arg 

460 465 

CCA AGG CAG TGG TTG TCT TCA GCT GGT CTT GGG GCT 
Pro Arg Gin Trp Leu Ser Ser Ala Gly Leu Gly Ala 
470 475 480 

ATG GGA TTT GGT TTG CCG GCT GCT GCT GGT GCT TCT 

Met Gly Phe Gly Leu Pro Ala Ala Ala Gly Ala ser 

485 490 
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GTG GCC AAC CCA GGT GTC ACT GTT GTT GAC ATC GAT 
Val Ala Asm Pro Gly Val Thr Val Val Asp He Asp 
495 500 

GGA GAT GGT AGC TTT CTC ATG AAC GTT CAG GAG CTA 
Gly Asp Gly Ser Phe Leu Met Asn Val Gin Glu Leu 
505 510 515 

GCT ATG ATC CGA ATT GAG AAC CTC CCA GTG AAG GTC 
Ala Met He Arg -He Glu Asn=Leu- Pro Val Lys val 
520 525 

TTT GTG CTA AAC AAC CAG CAC CTG GGG ATG GTG GTG 
Phe val Leu Asn Asn Gin His Leu Gly Met Val Val 
530 535 540 

CAG TGG GAG GAC AGG TTC TAT AAG GCC AAC AGA GCG 
Gin Tip Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala 
545 550 

CAC ACA TAC TTG GGA AAC CCA GAG AAT GAA AGT GAG 

His Thr Tyr Leu Gly Asn Pro Glu Asn Glu Ser Glu 

555 560 

ATA TAT CCA GAT TTC GTG ACG ATC GCC AAA GGG TTC 
lie Tyr Pro Asp Phe Val Thr He Ala Lys Gly Phe 
565 570 575 

AAC ATT CCA GCG GTC CGT GTG ACA AAG AAG AAC GAA 
Asn He Pro Ala Val Arg Val Thr Lys Lys Asn Glu 
580 585 

GTC CGC GCA GCG ATA AAG AAG ATG CTC GAG ACT CCA 
Val Arg Ala Ala He Lys Lys Met Leu Glu Thr Pro 
590 595 600 
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GGG CCG TAC CTC TTG GAT ATA ATC GTC CCA CAC CAG 
Gly Pro Tyr Leu Leu Asp lie lie Val Pro His Gin 



GAG CAT GTG TTG CCT ATG ATC CCT AGT GGT GGG GCT 
Glu His Val Leu Pro Met He Pro Ser Gly Gly Ala 
615 620 

TTC AAG GAT ATG ATC CTG GAT GGT GAT GGC AGG ACT 

Phe Lys Asp Met He Leu Asp Gly Asp Gly Arg Thr 

-62 5-^- — , *6'»0— 6 35 ~~ =^ 

GTG TAC 

val Tyr 

638 

TGATCTAAAA TCCAGCAAG 



Claims 

1. A monocot nucleic acid sequence encoding a functional AHAS enzyme, which enzyme has an amino 
as acid substitution relative to a wild-type monocot AHAS enzyme, and which substitution confers 

imidazolinone-specific resistance to the enzyme. 

2. The sequence of Claim 1 in which the monocot is corn and the substitution is at position 621 in the 
wild-type corn AHAS enzyme. 

40 

3. The sequence of Claim 2 in which the substituted amino acid is asparagine. 

4. A functional monocot AHAS enzyme which has an amino acid substitution relative to a monocot wild- 
type AHAS enzyme, and which substitution confers imidazolinone-specific resistance to the enzyme. 

45 

5. The enzyme of Claim 4 in which the monocot is corn and the substitution is at position 621 in the wild- 
type corn AHAS enzyme. 

6. The enzyme of Claim 5 in which the substituted amino acid is asparagine. 

7. A transformation vector comprising the nucleic acid of Claim 1 . 

a A host cell comprising the nucleic acid sequence of Claim 1, or the vector of Claim 7. 

tt 9. The host cell of Claim 8 which is a plant cell or a bacterial cell. 

10. An imidazolinone-specific resistant mature plant containing the nucleic acid sequence of Claim 1, or 
seed or pollen therefrom. 
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11. A method of conferring imidazolinone-specific resistance to a plant cell which comprises providing the 
plant cell with the nucleic acid sequence of Claim 1 . 

12. A method for growing imidazolinone-specific resistant plants which comprises cultivating a plant which 
5 produces the enzyme of Claim 4 in the presence of an inhibitory amount of imidazolinone. 

13. A method of selecting host cells successfully transformed with a gene of interest which comprises 
providing to prospective host cells the gene of interest linked to the nucleic acid sequence of Claim 1, 
or unlinked but in the presence of the nucleic acid sequence of Claim 1. growing the cells in the 

io presence of an inhibitory amount of imidazolinone and identifying surviving cells as containing the gene 

of interest. 

14. A nucleic acid construct comprising the sequence of Claim 1 linked to a gene encoding an ag- 
ronomically useful trait. 
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FIG. 5A 



W22/1A and B73/7-4 5|TAGTG3| 
sequence : 3'ATCTG5' 

XI12/8A sequence: 5'TAATG3' 
3'ATTAC5' 
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50 



70 

X 



ATG GCC ACC GCC GCC GCC GCG TCT ACC GCG CTC ACT GGC GCC ACT ACC GCT GCG 

Met Ala Thr Ala Ala Ala Ala Ser Thr Ala Leu Thr Gly Ala Thr Thr Ala Ala 

100 110 120 130 140 

I I X X X 

CCC AAG GCG AGG CGC CGG GCG CAC CTC CTG GCC ACC CGC CGC GCC CTC GCC GCG 

Pro Lys Ala Arg Arg Arg Ala His Leu Leu Ala Thr Arg Arg Ala Leu AIq Ala 



150 



160 
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.190 
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"x 
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Gly Pro 


Thr 


Asp 


Pro Arg Lys Gly Ala Asp 


He Leu 
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260 
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300 
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X 






X X 






GAG TCC CTC GAG CGC 


TGC GGC 


GTC 


CGC 


GAC GTC TTC GCC TAC CCC 


GGC GGC 


GCG 


Glu Ser Leu Glu Arg 


Cys Gly 


Val 


Arg 


Asp Yal Phe Ala Tyr Pro 


Gly Gly 


Ala 


310 320 




330 




340 350 




360 


X X 
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X X 
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TCC ATG GAG ATC CAC 


CAG GCA 
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Thr 


Arg Ser Pro Val lie Ala 


Asn His 


Leu 


370 


380 






390 400 


410 
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X 






X X 


X 




TTC CGC CAC GAG CAA 


GGG GAG 


GCC 


TTT 


GCG GCC TCC GGC TAC GCG 


CGC TCC 


TCG 


Phe Arg His Glu Gin 


Gly Glu 


Ala 


Phe 


Ala Ala Ser Gly Tyr Ala 


Arg Ser 


Ser 
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420 430 440 450 460 

X X I * x 

GGC CGC GTC GGC GTC TGC-ATC GGG ACC TCC GGC CCC GGC GCC ACC AAC CTT GTC 
Gly Ar9 Val Gly Val Cys He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

470 4B0 490 500 510 520 

XX XXI X 

TCC GCG CTC GCC GAG GCG C7G CTC GAT TCC GTC CCC ATG GTC GCC ATC ACG GGA 
Ser Ala Leu Ala Asp Ala Leu Leu Asp Ser Val Pro Met Val Ala He Thr Gly 

530 _ 540 550 560 . .._ 570 

XX XXX 

CAG 6TG CCG CGA CGC ATG ATT GGC ACC GAC GCC TTC CAG GAG ACG CCC ATC GTC 
Gin Val Pro Arg Arg Met He Gly Thr Asp Ala Phe Gin Glu Thr Pro lie Val 



580 590 600 610 620 630 





X 


X 


X 


X 
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Lys Asp He Gin 
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740 750 760 770 780 790 

X X X X X X 

TGG GAC AAG CCC ATG AGT CTG CCT GGG TAC ATT GCG CGC CTT CCC AAG CCC CCT 

Trp Asp Lys Pro Het Ser Leu Pro Gly Tyr He Ala Arg Leu Pro Lys Pro Pro 
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800 810 820 830 840 

ii iii 
GCG ACT GAG TTG CTT GAG CAG GTG CTG CGT CTT GTT GGT GAA TCC CGG CGC CCT 
Ala Thr Glu Leu Leu Glu Gin Val Leu Arg Leu Val Gly Glu Ser Arg Arg Pro 

850 860 870 880 890 900 

i i i i i i 

GTT CTT TAT GTT GGC GGT GCG TGC GCA GCA TCT GGT GAG GAG TTG CGA CGC TTT 

Val Leu Tyr Val Gly Gly Ala Cys Ala Ala Ser Gly Glu Glu Leu Arg Arg Phe 

,910 920 Mb 940 950 

iik ii 
GTG GAG CTG ACT GGA ATC CCG GTC ACA ACT ACT CTT ATG GGC CTC GGC AAC TTC 
Val Glu Leu Thr Gly He Pro Val Thr Thr Thr Leu Wei Gly Leu Gly Asn Phe 

960 970 980 990 1000 

i i i i i 

CCC AGC GAC GAC CCA CTG TCT CTG CGC ATG CTA GGT ATG CAT GGC ACG GTG TAT 

Pro Ser Asp Asp Pro Leu Ser Leu Arg Met Leu Gly Mei His Gly Thr Val Tyr 

1010 1020 1030 1040 1050 1060 

ii iii i 

GCA AAT TAT GCA GTG GAT AAG GCC GAT CTG TTG CTT GCA CTT GGT GTG CGG TTT 

Ala Asn Tyr Ala Val Asp Lys Ala Asp Leu Leu Leu Ala Leu Gly Val Arg Phe 

1070 1080 1090 1100 1110 

ii iii 
GAT GAT CGT GTG ACA GGG AAG ATT GAG GCT TTT GCA AGC AGG GCT AAG ATT GTG 
Asp Asp Arg Val Thr Gly Lys He Glu Ala Phe Ala Ser Arg Ala Lys lie Val 

1120 U30 1140 1150 1160 1170 

iii iii 
CAC GTT GAT ATT GAT CCG GCT GAG ATT GGC AAG AAC AAG CAG CCA CAT GTG TCC 
His Val Asp He Asp Pro Ala Glu He Gly Lys Asn Lys Gin Pro His Val Ser 
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i x I IX 

ATC TGT GCA GAT GTT AAG CTT GCT TTG CAG GGC ATG AAT GCT CTT CTT GAA GGA 

He Cys Ala Asp Val Lys Leu Ala Leu Gin Gly Met Asn Ala Leu Leu Glu Gly 
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10 20 30 40 50 60 

XU2/8A AACCC TCGCG CCGCC TCCGA GACAG CCGCC GCAAC CATGG CCACC GCCGC CGCCG CGTCT 

V22/1A AACCC TCGCG CCGCC TCCGA GACAG CCGCC GCAAC CATGG CCACC GCCGC CGCCG CGTCT 

inn nut mil mn urn urn mil urn urn urn inn urn 

B73/7-4 AACCC TCGCG CCGCC TCCGA GACAG CCGCC GCAAC CATGG CCACC GCCGC CGCCG CGTCT 

mu mn urn mn mn urn urn nm mil inn um urn 

XI12/8A AACCC TCGCG CCGCC TCCGA GACAG CCGCC GCAAC CATGG CCACC GCCGC CGCCG CGTCT 

70 80 90 100 110 120 

XI12/8A ACCGC .GCTCA-CJGGC GCCAC TACCG CTGCG CCCAA .GGCGA. GGCGC CGGGC GCACC TCCTG 

W22/IA ACCGC GCTCA CTGGC GCCAC TACCG CTGCG CCCAA GGCGA GGCGC CGGGC GCACC TCCTG 

mu mn mu um mn um nm um mu um mu um 

B73/7-4 ACCGC GCTCA CTGGC GCCAC TACCG CTGCG CCCAA GGCGA GGCGC CGGGC GCACC TCCTG 

mu mu mu um mu um nm um um um um um 

XU2/8A ACCGC GCTCA CTGGC GCCAC TACCG CTGCG CCCAA GGCGA 6GCGC CGGGC GCACC TCCTG 

130 140 150 160 170 180 

XU2/8A GCCAC CCGCC GCGCC CTCGC CGCGC CCATC AGGTG CTCAG CGGCG TCACC CGCCA TGCCG 

V22/1A GCCAC CCGCC GCGCC CTCGC CGCGC CCATC AGGTG CTCAG CGGCG TCACC CGCCA TGCCG 

um mu mu um um mu mu mu mu um mu um 

B73/7-4 GCCAC CCGCC GCGCC CTCGC CGCGC CCATC AGGTG CTCAG CGGCG TCACC CGCCA TGCCG 

um um um um um mu um um um mu um um 

XU2/8A GCCAC CCGCC GCGCC CTCGC CGCGC CCATC AGGTG CTCAG CGGCG TCACC CGCCA TGCCG 

190 200 210 220 230 240 

XI12/BA ATGGC TCCCC CGGCC ACCCC GCTCC GGCCG TGGGG CCCCA CCGAT CCCCG CAAGG GCGCC 

tf > > 

V22/1A ATGGC TCCCC CGGCC ACCCC GCTCC GGCCG TGGGG CCCCA CCGAT CCCCG CAAGG GCGCC 

um mu um um um um um um uu um um 1 u 

B73/7-4 ATGGC TCCCC CGGCC ACCCC GCTCC GGCCG TGGGG CCCCA CCGAg CCCCG CAAGG GtGCt 

um um um um mu mu mu um uu mu um 1 u 

X112/8A ATGGC TCCCC CGGCC ACCCC GCTCC GGCCG TGGGG CCCCA CCGAT CCCCG CAAGG GCGCC 
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250 260 270 280 290 300 

X112/8A GftCAT CCTCG TCGAG TCCCT CGftGC GCTGC GGCGT CCGCG AGGTC TTC6C CTACC CCGGC 

W22/IA GACAT CCTCG TCGAG TCCCT CGAGC GCTGC GGCGT CCGCG ACGTC TTCGC CTACC CCGGC 

11111 11111 11111 11111 11111 11111 11111 11111 Hill inn inn Hill 

B73/7-4 GACAT CCTCG TCGAG TCCCT CGAGC GCTGC GGCGT CCGCG ACGTC TTCGC CTACC CCGGC 

11111 11111 11111 11111 Hill 11111 11111 Hill 11111 1U11 11111 11111 

XH2/8A GACAT CCTCG TCGAG TCCCT CGAGC GCTGC GGCGT CCGCG ACGTC TTCGC CTACC CCGGC 

310 321 -~ 33, L_ 340 ^ 35 i 360 

X112/8A GGCGC GTCCA TGGAG AT CCA 'CCAGG CACTC ACCCG CTCCC CCGTC ATCGC CAACC ACCTC 

V22/1A GGCGC GTCCA TGGAG ATCCA CCAGG CACTC ACCCG CTCCC CCGTC ATCGC CAACC ACCTC 
11111 11111 11111 11111 Hill 11111 11111 11111 lllll lllll 11111 lllll 

XB73/7-4 GGCGC GTCCA TGGAG ATCCA CCAGG CACTC ACCCG CTCCC CCGTC ATCGC CAACC ACCTC 
lllll lllll lllll lllll lllll lllll lllll Hill lllll lllll lllll lllll 

XI12/8A GGCGC GTCCA TGGAG ATCCA CCAGG CACTC ACCCG CTCCC CCGTC ATCGC CAACC ACCTC 

370 380 390 400 410 420 

X112/8A TTCCG CCACG AGCAA GGGGA GGCCT TTGCG GCCTC CGGCT ACGCG CGCTC CTCGG GCCGC 

> 

V22/1A TTCCG CCACG AGCAA GGGGA GGCCT TTGCG GCCTC CGGCT ACGCG CGCTC CTCGG GCCGC 
lllll lllll lllll Hill lllll 1111 lllll lllll lllll 11111 11H1 lllll 

B73/7-4 TTCCG CCAC6 AGCAA GGGGA GGCCT TTGCc GCCTC CGGCT ACGCG CGCTC CTCGG GCCGC 
lllll lllll lllll lllll lllll 1111 lllll lllll lllll lllll lllll lllll 

XI12/8A TTCCG CCACG AGCAA GGGGA GGCCT TTGCG GCCTC CGGCT ACGCG CGCTC CTCGG GCCGC 

430 440 450 460 470 480 

XI12/8A GTCGG CGTCT GCATC GCCAC CTCCG GCCCC GGCGC CACCA ACCTT GTCTC CGCGC TCGCC 

> 

V22/1A GTCGG CGTCT GCATC GCCAC CTCCG GCCCC GGCGC CACCA ACCTT GTCTC CGCGC TCGCC 
lllll lllll lllll lllll lllll lllll lllll lllll 1111 lllll lllll lllll 
B73/7-4 GTCGG CGTCT GCATC GCCAC CTCCG GCCCC GGCGC CACCA ACCTa GTCTC CGCGC TCGCC 

mil 11111 11111 11111 11111 urn inn mil mi urn utii 11111 

XI12/BA GTCGG CGTCT GCATC GCCAC CTCCG GCCCC GGCGC CACCA ACCTT GTCTC CGCGC TCGCC 
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490 500 510 520 530 540 

XI12/8A GACGC GCTGC TCGAT TCCGT CCCCA TGG7C GCCA7 CACGG GACAG GTGCC GCGAC GCATG 

W22/IA GACGC GCTGC TCGAT TCCGT CCCCA TGGTC GCCAT CACGG GACAG GTGCC GCGAC GCATG 

mil uui mil nut 11111 mn nm inn mil mn nm inn 

B73/7-4 GACGC GCTGC TCGAT TCCGT CCCCA TGGTC GCCAT CACGG GACAG GTGCC GCGAC GCATG 

inn inn mn urn nm nut urn mn urn inn 11111 nm 

XI12/8A GACGC GCTGC TCGAT TCCGT CCCCA TGGTC GCCAT CACGG GACAG GTGCC GCGAC GCATG 

550 560 570 580 590 600 

XI12/8A ATTGG" CACCOCGCC TTCCA GGAGA CGCCC ATCGT C« TCACC CGCTC CATCA CCAAG 

W22/IA ATTGG CACCG ACGCC TTCCA GGAGA CGCCC ATCGT CGAGG TCACC CGCTC CATCA CCAAG 

urn mn nm 1 mi mn mn mn nm nm nm nm urn 

B73/7-4 ATTGG CACCG ACGCC TTCCA GGAGA CGCCC ATCGT CGAGG TCACC CGCTC CATCA CCAAG 

nm mn nm mil nm inn 11111 inn nm inn nm inn 

XI12/8A ATTGG CACCG ACGCC TTCCA GGAGA CGCCC ATCGT CGAGG TCACC CGCTC CATCA CCAAG 

610 620 630 640 650 660 

XU2/8A CACAA CTACC TGGTC CTCGA CGTCG ACGAC ATCCC CCGCG TCGTG CAGGA GGCTT TCTTC 

V22/1A CACAA CTACC TGGTC CTCGA CGTCG ACGAC ATCCC CCGCG TCGTG CAGGA GGCTT TCTTC 

inn mn nm inn 11111 mn nm mn nm inn nm mn 

B73/7-4 CACAA CTACC TGGTC CTCGA CGTCG ACGAC ATCCC CCGCG TCGTG CAGGA GGCTT TCTTC 

inn inn tun 11111 mn nm mn nm mn inn nm nm 

X112/8A CACAA CTACC TGGTC CTCGA CGTCG ACGAC ATCCC CCGCG TCGTG CAGGA GGCTT TCTTC 

670 680 690 700 710 720 

XU2/8A CTCGC CTCCT CTGGT CGACC GGGGC CGGTG CTTGT CGACA TCCCC AAGGA CATCC AGCAG 

> 

U22/1A CTCGC CTCCT CTGGT CGACC GGGGC CGGTG CTTGT CGACA TCCCC AAGGA CATCC AGCAG 

inn inn nm mn mi mn inn nm nm nm mn mn 

B73/7-4 CTCGC CTCCT CTGGT CGACC oGGGC CGGTG CTTGT CGACA TCCCC AAGGA CATCC AGCAG 

mn inn nm nm mi 11111 11111 11111 nm inn 11111 nm 

XI12/8A CTCGC CTCCT CTGGT CGACC GGGGC CGGTG CTTGT CGACA TCCCC AAGGA CATCC AGCAG 
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730 740 750 760 770 780 

XU2/8A CAGAT GGCGG TGCCT GTCTG GGACA AGCCC ATGAG TCTGC CTGGG TACAT TGCGC GCCTT 

W22/1A CAGAT GGCGG TGCCT GTCTG GGACA AGCCC ATGAG TCTGC CTGGG TACAT TGCGC GCCTT 

urn urn urn urn 11111 inn 11111 inn inn nm 11111 inn 

B73/7-4 CAGAT GGCGG TGCCT GTCTG GGACA AGCCC ATGAG TCTGC CTGGG TACAT TGCGC GCCTT 

urn mu urn nm 11111 nm urn niu urn 11111 um nm 

X112/8A CAGAT GGCGG TGCCT GTCTG GGACA AGCCC ATGAG TCTGC CTGGG TACAT TGCGC GCCTT 

790 800 810 820 830 840 

TI 12/8A CCCAA GCCtrXTGCG ACTGA~GTTGC TTG'AG CAGGT'GCTGC 'GTCTT GTTGG TGAAT CCCGG 

> 

X22/IA CCCAA GCCCC GTGCG ACTGA GTTGC TTGAG CAGGT GCTGC GTCTT GTTGG TGAAT CCCGG 

mu inn nm nm nm um mu um mu um um um 

B73/7-4 CCCAA GCCCC CTGCG ACTGA GTTGC TTGAG CAGGT GCTGC GTCTT GTTGG TGAAT GgCGG 

mn uni inn nm um um inu nm nm inn 11111 1 m 

XI12/8A CCCAA GCCCC CTGCG ACTGA GTTGC TTGAG CAGGT GCTGC GTCTT GTTGG TGAAT CCCGG 

850 860 870 880 890 900 

XU2/8A CGCCC TGTTC TTTAT GTTGG CGGTG CGTGC GCAGC ATCTG GTGAG GAGTT GCGAC GCTTT 
> 

1122/ IA CGCCC TGTTC TTTAT GTTGG CGGTG GCTGC GCAGC ATCTG GTGAG GAGTT GCGAC GCTTT 

um mu mu u u um um inn mu um mu inn inn 

B73/7-4 CGCCC TGTTC TTTAT GTgGG CGGTG GCTGC GCAGC ATCTG GTGAG GAGTT GCGAC GCTTT 

mu nm inn u u mu mn mn um um um um mu 

X112/8A CGCCC TGTTC TTTAT GTTGG CGGTG GCTGC GCAGC ATCTG GTGAG GAGTT GCGAC GCTTT 

910 920 930 940 950 960 

XU2/8A GTGGA GCTGA CTGGA ATCCC GGTCA CAACT ACTCT TATGG GCCTC GGCAA CTTCC CCAGC 

U22/IA GTGGA GCTGA CTGGA ATCCC GGTCA CAACT ACTCT TATGG GCCTC GGCAA CTTCC CCAGC 

um mn um mn um mu um mn um mu mn nm 

B73/7-4 GTGGA GCTGA CTGGA ATCCC GGTCA CAACT ACTCT TATGG GCCTC GGCAA CTTCC CCAGC 

um um mu mu um um nm urn um mu mu um 

XU2/8A GTGGA GCTGA CTGGA ATCCC GGTCA CAACT ACTCT TATGG GCCTC GGCAA CTTCC CCAGC 
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970 980 990 1000 1010 1020 

XI1Z/8A GACGA CCCAC TGTCT CTGCG CATGC TAGGT ATGCA TGGCA CG6TG TATGC AAATT ATGCA 

> 

W22/1A GACGA CCCAC TGTCT CTGCG CATGC TAGGT ATGCA TGGCA CGGTG TATGC AAATT ATGCA 

inn mil urn nut urn urn tun m i inn inn inn mil 

B73/7-4 GACGA CCCAC TGTCT CTGCG CATGC TAGGT ATGCA TGGgA CGGTG TATGC AAATT ATGCA 

nm mn nm inn inn urn urn in 1 urn inn inn inn 

XU2/8A GACGA CCCAC TGTCT CTGCG CATGC TAGGT ATGCA TGGCA CGGTG TATGC AAATT ATGCA 

1030 1040 1050 1060 1070 1080 

XI12/8A GTGGA TAAGG "CCGAT CTGTT-'GCTTG' CACTf GGTGT~GCGGT-^TTGAT GATCG TGTGJrCAGGG 

W22/1A GTGGA TAAGG CCGAT CTGTT GCTTG CACTT GGTGT GCGGT TTGAT GATCG TGTGA CAGGG) 

nm mil inn inn nm nm inn nm 11111 inn nn 11111 

B73/7-4 GTGGA TAAGG CCGAT CTGTT GCTTG CACTT GGTGT GCGGT TTGAT GATCG cGTGA CAGGG) 

nm inn mn nm nm nm nm mn nm nm nn inn 

X112/8A GTGGA TAAGG CCGAT CTGTT GCTTG CACTT GGTGT GCGGT TTGAT GATCG TGTGA CAGGG 

1090 1100 1110 1120 1130 1140 

XI12/8A AAGAT TGAGG CTTTT GCAAG CAGGG CTAAG ATTGT GCACG TTGAT ATTGA TCCGG CTGAG 

V22/1A AAGAT TGAGG CTTTT GCAAG CAGGG CTAAG ATTGT GCACG TTGAT ATTGA TCCGG CTGAG 

mn nm mn inn inn mn mn nm mn nm mn nm 

B73/7-4 AAGAT TGAGG CTTTT GCAAG CAGGG CTAAG ATTGT GCACG TTGAT ATTGA TCCGG CTGAG 

nm mn mn inn inn nm nm nm nm mn mn inn 

XI12/8A AAGAT TGAGG CTTTT GCAAG CAGGG CTAAG ATTGT GCACG TTGAT ATTGA TCCGG CTGAG 

1150 1160 1170 IIBO 1190 1200 

XI12/8A ATTGG CAAGA ACAAG CA6CC ACATG TGTCC ATCTG TGCAG ATGTT AAGCT TGCTT TGCAG 

V22/IA ATTGG CAAGA ACAAG CAGCC ACATG TGTCC ATCTG TGCAG ATGTT AAGCT TGCTT TGCAG 

nm mn mu mn 11111 mn nm nm inn inn nm inn 

B73/7-4 ATTGG CAAGA ACAAG CAGCC ACATG TGTCC ATCTG TGCAG ATGTT AAGCT TGCTT TGCAG 

mn inn nm inn mn inn mn nm mn nm mn mu 

XI12/8A ATTGG CAAGA ACAAG CAGCC ACATG TGTCC ATCTG TGCAG ATGTT AAGCT TGCTT TGCAG 
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1210 1220 1230 1240 1250 1260 

XI12/8A GGCAT GAATG CTCTT CTTGA AGGAA GCACA TCAAA GAAGA GCTTT GACTT 7GGCT CATGG 

W22/1A GGCAT GAATG CTCTT CTTGA AGGAA GCACA TCAAA GAAGA GCTTT GACTT TGGCT CATGG 
11111 11111 11111 11111 11111 inn Hill lllll 11111 11111 11111 lllll 
B73/7-4 GGCAT GAATG CTCTT CTTGA AGGAA GCACA TCAAA GAAGA GCTTT GACTT TGGCT CATGG 

inn 11111 11111 11111 11111 urn inn inn mu inn mn inn 

XI12/BA GGCAT GAATG CTCTT CTTGA AGGAA GCACA TCAAA GAAGA GCTTT GACTT TGGCT CATGG 

. «ffiL- 1280 1290 .. *W3M*^«132k- 

XI12/8A AACGA TGAGT TGGAT CAGCA GAAGA GGGAA TTCCC CCTTG GGTAT AAAAC ATCTA ATGAG 

V22/1A AACGA TGAGT TGGAT CAGCA GAAGA GGGAA TTCCC CCTTG GGTAT AAAAC ATCTA ATGAG 

nni 11111 mn 11111 inn mn urn 11111 11111 11111 urn mu 

B73/65 ' AACGA TGAGT TGGAT CAGCA GAAGA GGGAA TTCCC CCTTG GGTAT AAAAC ATCTA ATGAG 

11111 11111 11111 inn 11111 mn nm 11111 11111 11111 11111 11111 

XU2/8A AACGA TGAGT TGGAT CAGCA GAAGA GGGAA TTCCC CCTTG GGTAT AAAAC ATCTA ATGAG 

1330 1340 1350 1360 1370 1380 

XI12/8A GAGAT CCAGC CACAA TATGC TATTC AGGTT CTTGA TGAGC TGACG AAAGG CGAGG CCATC 

1122/ 1 A GAGAT CCAGC CACAA TATGC TATTC AGGTT CTTGA TGAGC TGACG AAAGG CGAGG CCATC 

mn nm inn 11111 11111 11111 mn nm 11111 inn inn mn 

B73/7-4 GAGAT CCAGC CACAA TATGC TATTC AGGTT CTTGA TGAGC TGACG AAAGG CGAGG CCATC 

mu 11111 11111 11111 11111 mu mu nm mn nm urn mu 

X112/BA GAGAT CCAGC CACAA TATGC TATTC AGGTT CTTGA TGAGC TGACG AAAGG CGAGG CCATC 

1390 1400 1410 H20 1430 1440 

XI12/8A ATCGG CACAG GTGTT GGGCA GCACC AGATG TGGGC GGCAC AGTAC TACAC TTACA AGCGG 

V22/1A ATCGG CACAG GTGTT GGGCA GCACC AGATG TGGGC GGCAC AGTAC TACAC TTACA AGCGG 

mu 11111 11111 mu 11111 11111 nm urn urn mu mn inn 

B73/7-4 ATCGG CACAG GTGTT GGGCA GCACC AGATG TGGGC GGCAC AGTAC TACAC TTACA AGCGG 

mu inn inn mu mu nm inn mu um mu mu mu 

XI12/8A ATCGG CACAG GTGTT GGGCA GCACC AGATG TGGGC GGCAC AGTAC TACAC TTACA AGCGG 
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1450 1460 1470 1480 1490 1500 

XI12/8A CCAAG GCAGT GGTTG TC7TC AGC7G GTCTT GGGGC TATGG GATTT GGTTT GCCGG CTGCT 

V22/IA CCAAG GCAGT GGTTG TCTTC AGCTG GTCTT GGGGC TATGG GATTT GGTTT GCCGG CTGCT 
11111 ltlll 11111 11111 111U UU1 Hill Hill Hill Hill Hill Hill 

B73/7-4 CCAAG GCAGT GGTTG TCTTC AGCTG GTCTT GGGGC TATGG GATTT GGTTT GCCGG CTGCT 
11111 11111 lllll 11111 Hill Hill Hill 11111 Hill 11111 11111 11111 

X112/8A CCAAG GCAGT GGTTG TCTTC AGCTG GTCTT GGGGC TATGG GATTT GGTTT GCCGG CTGCT 

1510 1520 ...153JL. ! ^J54L W U554_.^J560 

XI 12/BA GCTGG TGCTT CTGTG GCCAA CIXAG GTGTT ACTGT TGTTG ACATC GATGG AGATG GTAGC 

> 

W22/1A GCTGG TGCTT CTGTG GCoAA CCCAG GTGTT ACTGT TGTTG ACATC GATGG AGATG GTAGC 
Hill lllll lllll lllll Hill llll Hill lllll 11111 lllll lllll lllll 

B73/7-4 GCTGG TGCTT CTGTG GCaAA CCCAG GTGTc ACTGT TGTTG ACATC GATGG AGATG GTAGC 
lllll lllll lllll 11 11 lllll llll lllll lllll lllll lllll lllll lllll 

X112/8A GCTGG TGCTT CTGTG GCCAA CCCAG GTGTT ACTGT TGTTG ACATC GATGG AGATG GTAGC 

1570 1580 1590 1600 1610 1620 

XI12/8A TTTCT CATGA ACGTT CAGGA GCTAG CTATG ATCCG AATTG AGAAC CTCCC GGTGA AGGTC 

> 

W22/1A TTTCT CATGA ACGTT CAGGA GCTAG CTATG ATCCG AATTG AGAAC CTCCC GGTGA AGGTC 
lllll lllll lllll lllll lllll lllll lllll lllll lllll lllll llll lllll 
B73/7-4 TTTCT CATGA ACGTT CAGGA GCTAG CTATG ATCCG AATTG AGAAC CTCCC oGTGA AGGTC 

inn inn 11111 urn urn mil 11111 11111 11111 11111 mi 11111 

XH2/8A TTTCT CATGA ACGTT CAGGA GCTAG CTATG ATCCG AATTG AGAAC CTCCC GGTGA AGGTC 

1630 1640 1650 1660 1670 1680 

X112/8A TTTGT GCTAA ACAAC CAGCA CCTGG GGATG GTGGT GCAGT GGGAG GACAG GTTCT ATAAG 

W22/IA TTTGT GCTAA ACAAC CAGCA CCTGG GGATG GTGGT GCAGT GGGAG GACAG GTTCT ATAAG 

11111 urn urn urn 11111 urn 11111 inn inn 11111 inn nm 

B73/7-4 TTTGT GCTAA ACAAC CAGCA CCTGG GGATG GTGGT GCAGT GGGAG GACAG GTTCT ATAAG 

11111 mil 11111 11111 nm nm urn 11111 11111 11111 11111 mil 

XI12/8A TTTGT GCTAA ACAAC CAGCA CCTGG GGATG GTGGT GCAGT GGGAG GACAG GTTCT ATAAG 
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1690 1700 1710 1720 1730 1740 

XI12/8A GCCAA CAGAG CGCAC ACATA CTTGG GAAAC CCAGA GAATG AAAGT 6AGA7 ATATC CAGAT 

V22/1A GCCAA CAGAG CGCAC ACATA CTTGG GAAAC CCAGA GAATG AAAGT GAGAT ATATC CAGAT 

inn nut urn urn urn urn urn nm tun urn 11111 mu 

B73/7-4 GCCAA CAGAG CGCAC ACATA CTTGG GAAAC CCAGA GAATG AAAGT GAGAT ATATC CAGAT 

urn urn urn nm 11111 nm urn inn inn mu inn um 

XI12/8A GCCAA CAGAG CGCAC ACATA CTTGG GAAAC CCAGA GAATG AAAGT GAGAT ATATC CAGAT 

_1750 1760 J770 1780 1790 _,1800 ,.. 

XI12/8A TTCGT GACGA TCGCC AAAGG GTTCA ACATT CCAGC GGTCC GTGTG ACAAA GAAGA ACGAA 

¥22/ 1A TTCGT GACGA TCGCC AAAGG GTTCA ACATT CCAGC GGTCC GTGTG ACAAA GAAGA ACGAA 

um nm mil um um mu mu um nm mu um inn 

B73/7-4 TTCGT GACGA TCGCC AAAGG GTTCA ACATT CCAGC GGTCC GTGTG ACAAA GAAGA ACGAA 

mil um mu um um um um inn um inn um mu 

XI12/8A TTCGT GACGA TCGCC AAAGG GTTCA ACATT CCAGC GGTCC GTGTG ACAAA GAAGA ACGAA 

1810 1820 1830 1840 1850 1860 

XU2/8A GTCCG CGCAG CGATA AAGAA GATGC TCGAG ACTCC AGGGC CGTAC CTCTT GGATA TAATC 

U22/JA GTCCG CGCAG CGATA AAGAA GATGC TCGAG ACTCC AGGGC CGTAC CTCTT GGATA TAATC 

nm mu nm um mu um nm mu um nm mu um 

B73/7-4 GTCCG CGCAG CGATA AAGAA GATGC TCGAG ACTCC AGGGC CGTAC CTCTT GGATA TAATC 

um mu um um um um mu um um um um urn 

XI12/8A GTCCG CGCAG CGATA AAGAA GATGC TCGAG ACTCC AGGGC CGTAC CTCTT GGATA TAATC 

1870 1880 1890 1900 1910 1920 

X112/8A GTCCC ACACC AGGAG CATGT GTTGC CTATG ATCCC TAATG GTGGG GCTTT CAAGG ATATG 

i 

U22/1A GTCCC ACACC AGGAG CATGT GTTGC CTATG ATCCC TAgTG GTGGG GCTTT CAAGG ATATG 

um mu um um um um tun mu inn um um um 

B73/7-4 GTCCC ACACC AGGAG CATGT GTTGC CTATG ATCCC TAgTG GTGGG GCTTT CAAGG ATATG 

um um um um um um um n u um um um um 

XI12/8A GTCCC ACACC AGGAG CATGT GTTGC CTATG ATCCC TAATG GTGGG GCTTT CAAGG ATATG 
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1930 1940 1950 1960 

XI 12/8A ATCCT GGATG GTGAT GGCAG GACTG TGTAC 7GATC TAAAA TCCAG CAAG 

V2E/1A ATCCT GGATG GTGAT GGCAG GACTG TGTAC TGATC TAAAA TCCAG CAAG) 

inn inn urn 11111 11m 11111 11m 11111 11111 1111 

B73/7-4 ATCCT GGATG GTGAT GGCAG GACTG TGTAC TGATC TAAAA TCCAG CAAG) 

11m um am 11111 11111 11111 11111 11m 11111 mi 

XHZ/8A ATCCT GGATG GTGAT GGCAG GACTG TGTAC TGATC TAAAA TCCAG CAAG 
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10 20 30 40 50 60 

X112/8A KATAA AASTA LTGAT TAAPK ARRRA HLLAT RRALA APIRC SAASP AHPHA PPATP LRPVG 

VH2/1A MATAA AASTA LTGAT TAAPK ARRRA HLLAT RRALA APIRC SAASP AHPHA PPATP LRPVG 

inn niti in i mil urn tun nni urn nni urn inn urn 

B73/7-4 KATAA AASTA LTGAT TAAPK ARRRA HLLAT RRALA APIRC SAASP AHPHA PPATP LRPVG 

urn urn mn urn urn urn mn urn urn nm urn urn 

XI12/8A MATAA AASTA LTGAT TAAPK ARRRA HLLAT RRALA APIRC SAASP AHPHA PPATP LRPVG 

---—70'— 80 -"N- 100 110 "120 

X112/BA PTDPR KG AD I LVESL ERCGV RDVFA YPGGA SHEIH OALTR SPVIA NHLFR HEQGE AFAAS 

W22/1A PTDPR KG AD I LVESL ERCGV RDVFA YPGGA SHE1H QALTR SPVIA NHLFR HEQGE AFAAS 

n ii urn inn mn mn nm mn nm mn imi mn mn 

B73/7-4 PTePR KGAD1 LVESL ERCGV RDVFA YPGGA SHEIH QALTR SPVIA NHLFR HEQGE AFAAS 

ii ii mn urn mn urn urn nm mn inn nni nm inn 

XI12/8A PTDPR KGAD.l LVESL ERCGV RDVFA YPGGA SHEIH QALTR SPVIA NHLFR HEQGE AFAAS 

130 140 150 160 170 180 

XI12/8A GYARS SGRVG VCIAT SGPGA TNLVS ALADA LLDSV PMVA1 TGQVP RRH1G TDAFQ ETP1V 

V22/IA GYARS SGRVG VCIAT SGPGA TNLVS ALADA LLDSV PHVAI TGQVP RRH1G TDAFQ ETPIV 

nm inn mn nm mn inn nm mn nm mil inn nm 

B73/7-4 GYARS SGRVG VCIAT SGPGA TNLVS ALADA LLDSV PHVAI TGQVP RRM1G TDAFQ ETPIV 

inn nni mn nm nm nni mn mn nm nm inn nm 

XI12/8A GYARS SGRVG VCIAT SGPGA TNLVS ALADA LLDSV PHVAI TGQVP RRHIG TDAFQ ETPIV 

190 200 210 220 230 240 

XI12/8A EVTRS ITKHN YLVLD VDDIP RWQE AFFLA SSGRP GPVLV DIPKD IQQQM AVPVW DKPMS 

M22/IA EVTRS ITKHN YLVLD VDDIP RWQE AFFLA SSGRP GPVLV DIPKD 1QQQH AVPVW DKPHS 

nm inn mn nm nm inn mn inn inn mn nm nm 

B73/7-4 EVTRS ITKHN YLVLD VDDIP RWQE AFFLA SSGRP GPVLV DIPKD IQQQH AYPVU DKPMS 

mn nm inn nm inn mn inn 11111 11111 inn inn nm 

XI12/BA EVTRS ITKHN YLVLD VDDIP RWQE AFFLA SSGRP GPVLV DIPKD IQQQH AVPVV DKPHS 
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250 260 270 280 290 300 

XI 12/BA LPGYI ARLPK PPATE LLEQV LRLVG ESRRP VLYVG GGCAA SGEEL RRFVE LTG1P VTTTL 

V22/1A LPGYI ARLPK PPATE LLEQV LRLVG ESRRP VLYVG GGCAA SGEEL RRFVE LTGIP VTTTL 

uin inn uiu urn nut mil 11111 urn urn mu urn niu 

B73/7-4 LPGYI ARLPK PPATE LLEQV LRLVG ESRRP VLYVG GGCAA SGEEL RRFVE LTGIP VTTTL 

mu niu urn mu mu uiu inn mu 11111 11111 mn mu 

XI12/8A LPGYI ARLPK PPATE LLEQV LRLVG ESRRP VLYVG GGCAA SEEL RRFVE LTGIP VTTTL 

.31.0,. ,321 330 „^40 350 360 . 

XI12/8A HGLGN FPSDD PLSLR HLGHH GTVYA NYAVD KADLL LALGV RFDDR VTGKI EAFAS RAK1V 

V22/1A MGLGN FPSDD PLSLR HLGMH GTVYA NYAVD KADLL LALGV RFDDR VTGKI EAFAS RAKIV 

urn urn urn urn mu mu urn mu mu mu mu mu 

B73/7-4 MGLGN FPSDD PLSLR HLGMH GTVYA NYAVD KADLL LALGV RFDDR VTGKI EAFAS RAKIV 

mu mu mu mu mu niu mu mu mu urn urn mu 

XU2/8A MGLGN FPSDD PLSLR HLGHH GTVYA NYAVD KADLL LALGV RFDDR VTGKI EAFAS RAKIV 

370 380 390 400 410 420 

XU2/8A HVDID PAE1G KNKQP HVSIC ADVKL ALQGH NALLE GSTSK KSFDF GSVND ELDQQ KREFP 

V22/1A HVDID PAEIG KNKQP HVSIC ADVKL ALQGM NALLE GSTSK KSFDF GSVND ELDQQ KREFP 

mn mu mu mu inn mu inn mu inn niu mn mn 

B73/7-4 HVDID PAEIG KNKQP HVSIC ADYKL ALQGH NALLE GSTSK KSFDF GSWND ELDQQ KREFP 

urn urn mu mu mu mu mu uiu mu urn mu mu 

XU2/8A HYDID PAEIG KNKQP HVSIC ADVKL ALQGM NALLE GSTSK KSFDF GSVND ELDQQ KREFP 

430 440 450 460 470 4B0 

XU2/8A LGYKT SNEEI QPQYA 1QVLD ELTKG EAUG TGVGQ HQHVA AQYYT YKRPR QVLSS AGLGA 

V22/1A LGYKT SNEEI QPQYA IQVLD ELTKG EAUG TGVGQ HQMVA AQYYT YKRPR QVLSS AGLGA 

niu mu mu mu urn mu mn urn urn niu inn niu 

B73/7-4 LGYKT SNEEI QPQYA IQVLD ELTKG EAUG TGVGQ HQMVA AQYYT YKRPR QVLSS AGLGA 

mu mu mu mn inn uiu mu niu mn mu mn uiu 

XU2/8A LGYKT SNEEI QPQYA IQVLD ELTKG EAUG TGVGQ HQHVA AQYYT YKRPR QVLSS AGLGA 
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490 500 510 520 530 540 

XI12/8A HGFGL PAAAG ASVAN PGVTV VD1DG DGSFL MNVQE LAHIR 1ENLP VKVFV LNNOH LGMVV 

UE2/LA HGFGL PAAAG ASVAN PGVTV VDIDG DGSFL MNVQE LAHIR 1ENLP VKVFV LNNQH LGMVV 

urn uiu urn mu inn um urn 11111 um 11111 nni inn 

B73/7-4 HGFGL PAAAG ASVAN PGVTV VDIDG DGSFL-MNVQE LAMIR IEfCP^VKVFV LNNQH LGMVV 

mil mu mu mu mu um uiu nut um urn um um 

XU2/8A HGFGL PAAAG ASVAN PGVTV VDIDG DGSFL MNVQE LAMIR 1ENLP VKVFV LNNQH LGMVV 

550 560 570 580 590 600 

XI12/8A QVEDR FYKAN RAHTY LGNPE NESEI YPDFV T1AKG FN1PA VRVTK KNEVR AAIKK HLETP 

V22/1A QVEDR FYKAN RAHTY LGNPE NESEI YPDFV T1AKG FN1PA VRYTK KNEVR AAIKK MLETP 

i in mu um mu mu um um um um um um um 

B73/7-4 QVEDR FYKAN RAHTY LGNPE NESEI YPDFV T1AKG FN1PA VRVTK KNEVR AAIKK HLETP 

i ui mu mu um mu um um um um mu mu inn 

XI12/8A QVEDR FYKAN RAHTY LGNPE NESEI YPDFV TIAKG FNIPA VRVTK KNEVR AAIKK MLETP 

610 620 630 

XU2/8A GPYLL DIIVP HQEHY LPHIP NGGAF KDM1L DGDGR TYY* 
i 

W22/1A GPYLL DIIVP HOEHV LPMIP sGGAF KDMIL DGDGR TVY> 

mu mil inn um mu mu um m 

B73/7-4 GPYLL DIIVP HQEHV LPMIP sGGAF KDMIL DGDGR TVY> 

mu mu um um uu mu um m 

XI12/8A GPYLL DIIVP HQEHV LPMIP NGGAF KDMIL DGDGR TVY 
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wild-type AHAS enzyme. The mutant gene can be 
used to transform other plants to herbicide resis- 
tance; in this regard, the invention also provides host 
cells and vectors containing the gene, which cells 
and vectors are useful in the transformation process. 
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